CoF-T2I: Video Models as Pure Visual Reasoners for Text-to-Image Generation Paper • 2601.10061 • Published 1 day ago • 20
UniAVGen: Unified Audio and Video Generation with Asymmetric Cross-Modal Interactions Paper • 2511.03334 • Published Nov 5, 2025 • 52
moonshotai/Kimi-Linear-48B-A3B-Instruct Text Generation • 49B • Updated about 1 month ago • 30.2k • 524
VMoBA: Mixture-of-Block Attention for Video Diffusion Models Paper • 2506.23858 • Published Jun 30, 2025 • 31