You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Apr 22, 2025. It is now read-only.
I have benchmarked with resnet50, resnet101, the bn_fusion performance improves with CPU (about 7%), but no improvement with cuda.
There is no noticeably difference between torch.cuda.cudnn.benchmark true and false.
My guess is that cudnn can optimize such case really good already.