Fix CUDA out-of-memory errors during W-Tiling
The std::shared_ptr
for the tile FFT was used incorrectly, such that planning the FFT for subsequent batches of tiles could fail in some situations. To make issues like this easier to debug, the error reporting in CU
and CUFFT
has been updated as well.