[Feature] Add Batch FFT for CUDA #6512

Critsium-xy · 2025-09-17T07:37:55Z

According to tests from 沐曦's engineers, using batch fft on GPU to replace some FFT loop may make caculation faster in some cases. This PR added a input parameter which allows users to choose whether to use batch fft.

Flying-dragon-boxing · 2025-09-20T13:17:30Z

I'm not sure if the batch size should be a input parameter or merely a function parameter that can be differently decided when developers use batch FFT to do different things. As for me, this parameter might be a situation-specific one.

Critsium-xy · 2025-09-23T06:46:16Z

I'm not sure if the batch size should be a input parameter or merely a function parameter that can be differently decided when developers use batch FFT to do different things. As for me, this parameter might be a situation-specific one.

I've added it as a function parameter in FFT module. But according to previous test, setting batch_size to different size leads to different caculation speed. I cannot easily figure out which size is the best so maybe leave this problem to user is an option😤

Critsium-xy added 4 commits September 17, 2025 15:33

Add fft_batch parameter

dd475a5

Add batch_size parameter in CUDA FFT

bc5fd88

Add batch function in FFT module

6785bdd

Added ;

c890b7f

Merge branch 'develop' into gpu_fftbatch

65953e3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature] Add Batch FFT for CUDA #6512

[Feature] Add Batch FFT for CUDA #6512

Uh oh!

Critsium-xy commented Sep 17, 2025

Uh oh!

Flying-dragon-boxing commented Sep 20, 2025

Uh oh!

Critsium-xy commented Sep 23, 2025

Uh oh!

Uh oh!

[Feature] Add Batch FFT for CUDA #6512

Are you sure you want to change the base?

[Feature] Add Batch FFT for CUDA #6512

Uh oh!

Conversation

Critsium-xy commented Sep 17, 2025

Uh oh!

Flying-dragon-boxing commented Sep 20, 2025

Uh oh!

Critsium-xy commented Sep 23, 2025

Uh oh!

Uh oh!