Step-Video-T2V

Step-Video-T2V

Open-Source, 204-Frame Video Generation from Text.

10 followers

Step-Video-T2V is the open-source text-to-video model series from StepFun. Up to 204-frame generation, high compression Video-VAE, and video-based DPO for enhanced quality. Achieves SOTA on Step-Video-T2V-Eval.
Step-Video-T2V gallery image
Step-Video-T2V gallery image
Step-Video-T2V gallery image
Step-Video-T2V gallery image
Step-Video-T2V gallery image
Step-Video-T2V gallery image
Step-Video-T2V gallery image
Free
Launch Team