Nano-World-Model Collection 🌍 A minimalist repository for training video world models based on diffusion-forcing. • 17 items • Updated 5 days ago • 4
Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation Paper • 2604.24763 • Published 8 days ago • 68
Unified 4D World Action Modeling from Video Priors with Asynchronous Denoising Paper • 2604.26694 • Published 6 days ago • 6
Nano-World-Model Collection 🌍 A minimalist repository for training video world models based on diffusion-forcing. • 17 items • Updated 5 days ago • 4
Nano-World-Model Collection 🌍 A minimalist repository for training video world models based on diffusion-forcing. • 17 items • Updated 5 days ago • 4
Nano-World-Model Collection 🌍 A minimalist repository for training video world models based on diffusion-forcing. • 17 items • Updated 5 days ago • 4
Nano-World-Model Collection 🌍 A minimalist repository for training video world models based on diffusion-forcing. • 17 items • Updated 5 days ago • 4