Hi, I'm new to k-wave and noticed that when running on the CPU c++ code pre-rpocessing is instant and then my simulation takes about 47 seconds, however when running the CUDA code, the simulation takes about 10 seconds, but 80 seconds is spent on pre-processing the FFT. Is this expected?