CRAFT uses video diffusion to generate diverse, photorealistic bimanual robot training data, enhancing learning with scalable and action-consistent demos.
Discover TempoControl, a method for fine-grained temporal control in text-to-video models enabling precise timing of visual elements without retraining.
DiReCT enhances video generation by disentangling contrastive trajectories, improving physics consistency and visual quality without extra training time.