TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

github.com

15 points by meander_water 3 hours ago


jjcm - 13 minutes ago

Looks like there is some quality reduction, but nonetheless 2s to generate a 5s video on a 5090 for WAN 2.1 is absolutely crazy. Excited to see more optimizations like this moving into 2026.

villgax - 12 minutes ago

I mean the baselines were deliberately worse and not how someone would be using these to begin with maybe noobs and the quoted number is only for DIT steps not for other encoding and decoding steps, which is actually quite high still. No actual use of FA4/Cutlass based kernels nor TRT at any point.