Streamline `sdc` for real sample density compensation factors (DCFs) #160

JeffFessler · 2026-01-03T19:36:08Z

This PR has the following changes:

Extend Allow convolve to work with Real arrays with arbitrary precision #159 to also work with GPU arrays, i.e., convolve! now works with real (GPU) array types.
Use the new Real capabilities of convolve! so that weights are always real in sdc, avoiding abs and other conversions between real and complex types.
Solve analytically for the real weights scaling factor c in sdc to avoid calling \.
Comments out the line weights_tmp .+= eps(T) that seems unnecessary. (Are there corner cases where it is needed? If so, then the tests should be expanded to cover such a case.)
Make an (almost) non-allocating version sdc!.
Offer two versions of sdc! - one with and one without the final "scaling" step that is not always essential.
Purge residual code (in comments) about conversion to Array is a workaround for CuNFFT.

The one aspect I could not figure out how to address here is how to recycle the working space in the complex array p.tmpVec as a real array. My attempts to do this using reinterpret have failed. It could be refined with another PR later if anyone has ideas.

nHackel · 2026-01-05T14:56:02Z

@JeffFessler I took a look at the allocations with

Profile.Allocs.@profile sample_rate=1.0 NFFTTools.sdc!(p, 10,
            w2,
            weights_tmp,
            workg,
            workf,
            workv,
)

and I think the allocations are fairly minimal.

Most allocations seem to happen due to the multi-threading book-keeping, either in FFTW or our @cthreads via OhMyThreads.jl. I think with the example the ratio of data to bookkeeping is making this seem worse than things are.

In regards to reusing p.tmpVec I don't have a solution yet, but something I noticed while reviewing was that the sdc code dispatches on p::AbstractNFFTPlans but uses things like p.J, p.N, p.tmpVec which aren't part of the abstract interface. That is of course already an existing problem. I don't think we have the necessary metadata in the abstract interface to write this function in a proper abstract way. In particular, we have size, size_in and size_out, but those don't give us all the required values.

Maybe I am missing something there though and it is possible

JeffFessler · 2026-01-07T04:21:45Z

Thanks for the notes about allocs - makes sense.
I updated to use size_* where possible, but you are right that we can't eliminate p.tmpVec because we need its type do GPU stuff since the JLArray type is not part of the Plan type. And I don't see how to avoid p.Ñ either. Seems odd that the abstract interface requires convolve! but doesn't provide the key ingredients needed for that method.

Would it make sense to retreat from AbstractNFFTPlans and just use NFFTPlan? I'm happy to make that change, and if someone later wants a more general version they can tackle the challenge of convolve! etc. Thoughts?

JeffFessler · 2026-01-07T13:37:35Z

tests are failing because "NFFTPlan has no field size_in" sigh.

nHackel · 2026-01-07T14:17:07Z

@JeffFessler size_in and size_out are both functions you apply to the plans.

I think it makes sense to retreat from AbstractNFFT and change both the types in the methods and the dependency fot NFFTPlans/NFFT.jl for now. I'm not sure if I'd even classify this as a breaking change, because the code as written never worked before 😅

Going foward, we could "fix" this by:

Extending the AbstractNFFT interface to define methods for the metadata we need to allocate the sdc vectors
Giving the sdc an additional parameter which determines the array type with a default value of Array
Providing a package extension on NFFT.jl which has a default value of tmpVec (and potentially reuses tmpVec as a sdc! input)

If I remember correctly, AbstractNFFT.jl has @mustimplement convolve! but with a docstring that states that the operation is optional.

Still it would be good to have fitting function to retrieve the values we need, if those are something that are generally required for an NFFT

nHackel · 2026-01-07T14:22:15Z

I think the missing value is only Ñ right? Which is roughly σN with some rounding logic and probably common to all implementations unless I got lost in the maths again.

As far as I know, at the moment only NonuniformFFTs.jl would be a possible candidate for another package which could be used for NFFTTools and I don't think it implements the convolve! calls

JeffFessler · 2026-01-08T19:07:16Z

@nHackel there is a probley with our plan of removing AbstractNFFTs from sdc.
The NFFT tests also call sdc with NFFTGPUArraysExt.GPU_NFFTPlan which is why the tests fail now.
So I think we need to revert to AbstractNFFT for now. Agreed?

nHackel · 2026-01-09T11:23:25Z

@JeffFessler ah good catch! I didn't think about that. Some day I might find the necessary time (and understanding of the maths) to port more convolutions options than just NFFT.full to the GPU and then we can have only one NFFTPlan type.

Or as a temporary workaround, I could try to make the NFFTPlan parametric enough for GPU arrays and then just leave most of the unused fields empty.

Either way, I think for this PR it's good enough if we simply:

Change the dependency from AbstractNFFTs to NFFT
Leave the interface as AbstractNFFT (but get there via NFFT and not AbstractNFFTs)
Make a comment in the source code that this only works for plans from NFFT.jl

JeffFessler · 2026-01-09T14:54:00Z

I have made the suggested changes I think, except for this one that I didn't grok:

Change the dependency from AbstractNFFTs to NFFT
I think we still need the AbstractNFFTs dependence.

So now my main remaining question is whether I can cut the commented out code about conversion to Array is a workaround for CuNFFT

nHackel · 2026-01-17T20:25:13Z

@JeffFessler I just had time to take a look at this again. I think I meant dropping the AbstractNFFT dependency and just doing:

# in NFFTTools.jl
using NFFT, NFFT.AbstractNFFTs

but ultimately it doesn't really make a difference. And we can just leave it as it is now

I think we can cut the comment and "worst" case it's still referred to in the PR and releases. IF the bug turns up again, we should try to turn it into a MWE.

Lastly, we also need to update the project.toml of NFFT.jl for your fix to the GPU extension

JeffFessler · 2026-01-18T21:12:22Z

we also need to update the project.toml of NFFT.jl for your fix to the GPU extension

I think you mean a version bump, right? I bumped versions for both NFFTTools and NFFT.
@nHackel Please check if I did it right because I am less sure how version changes work with extensions.

JeffFessler added 11 commits December 29, 2025 23:27

Ensure weights remain real and positive

6d1843b

Merge branch 'master' into jf/sdc-real

5f93298

Use real weights with real convolve!

2ec58f8

Odd-sized test

b532dcc

Try to make it non-allocating

34b62f6

Try to test non-allocation

e953b33

Allow GPU convolve! with real arrays

2a6994a

Add GPU tests for convolve!

d1dd089

GPU test of sdc, with explicit imports

8b39fd9

Fix allocation, remove eps()

ec4946d

comment

a9d3ea4

JeffFessler marked this pull request as draft January 3, 2026 19:36

JeffFessler requested review from nHackel and tknopp January 3, 2026 19:37

JeffFessler added 4 commits January 6, 2026 23:09

fix docstring for size_in -> D

6d2be2c

Use size_in not J

e9fa756

Use size_in and size_out not p.J and p.N

3f354be

Fix comment about allocs

c0bf3b7

JeffFessler added 6 commits January 7, 2026 09:57

abandon AbstractNFFTs, fix size_*

fadf60a

Fix? imports - untested WIP

557e84d

import size_

ca32366

fix size_ and minimize dependence on p.tmpVec

3ef99e2

add deps for NFFT

bfbecdf

alloc comments

e2ea260

JeffFessler added 2 commits January 9, 2026 07:20

Back to AbstractNFFTs

e0edf97

Comment with warning about AbstractNFFTs interface; clean up

f1cfde7

JeffFessler marked this pull request as ready for review January 9, 2026 14:54

Isolate non-essential scaling step

1a75f89

JeffFessler added 3 commits January 18, 2026 13:01

Remove old commented code

a6bb78c

v0.2.7 to v0.3 (sdc changes slightly break)

67b5ce3

v0.14.2 to v0.14.3 due to GPU extension changes

012688b

nHackel merged commit cb3cac0 into master Jan 21, 2026
5 of 6 checks passed

JeffFessler deleted the jf/sdc-real branch January 21, 2026 14:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Streamline `sdc` for real sample density compensation factors (DCFs) #160

Streamline `sdc` for real sample density compensation factors (DCFs) #160

Uh oh!

JeffFessler commented Jan 3, 2026 •

edited

Loading

Uh oh!

nHackel commented Jan 5, 2026

Uh oh!

JeffFessler commented Jan 7, 2026

Uh oh!

JeffFessler commented Jan 7, 2026

Uh oh!

nHackel commented Jan 7, 2026

Uh oh!

nHackel commented Jan 7, 2026 •

edited

Loading

Uh oh!

JeffFessler commented Jan 8, 2026

Uh oh!

nHackel commented Jan 9, 2026

Uh oh!

JeffFessler commented Jan 9, 2026

Uh oh!

nHackel commented Jan 17, 2026

Uh oh!

JeffFessler commented Jan 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Streamline sdc for real sample density compensation factors (DCFs) #160

Streamline sdc for real sample density compensation factors (DCFs) #160

Uh oh!

Conversation

JeffFessler commented Jan 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nHackel commented Jan 5, 2026

Uh oh!

JeffFessler commented Jan 7, 2026

Uh oh!

JeffFessler commented Jan 7, 2026

Uh oh!

nHackel commented Jan 7, 2026

Uh oh!

nHackel commented Jan 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JeffFessler commented Jan 8, 2026

Uh oh!

nHackel commented Jan 9, 2026

Uh oh!

JeffFessler commented Jan 9, 2026

Uh oh!

nHackel commented Jan 17, 2026

Uh oh!

JeffFessler commented Jan 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Streamline `sdc` for real sample density compensation factors (DCFs) #160

Streamline `sdc` for real sample density compensation factors (DCFs) #160

JeffFessler commented Jan 3, 2026 •

edited

Loading

nHackel commented Jan 7, 2026 •

edited

Loading