LongVideoAgent: Multi-Agent Reasoning with Long Videos Paper âĒ 2512.20618 âĒ Published Dec 23, 2025 âĒ 55
Pico-Banana-400K: A Large-Scale Dataset for Text-Guided Image Editing Paper âĒ 2510.19808 âĒ Published Oct 22, 2025 âĒ 30
Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance Paper âĒ 2512.08765 âĒ Published Dec 9, 2025 âĒ 132
MultiShotMaster: A Controllable Multi-Shot Video Generation Framework Paper âĒ 2512.03041 âĒ Published Dec 2, 2025 âĒ 64
Can World Simulators Reason? Gen-ViRe: A Generative Visual Reasoning Benchmark Paper âĒ 2511.13853 âĒ Published Nov 17, 2025 âĒ 36
Running on L4 Featured 2.22k MagicQuill ðŠķ 2.22k Edit images using scribbles, color hints, and text prompts
MultiBooth: Towards Generating All Your Concepts in an Image from Text Paper âĒ 2404.14239 âĒ Published Apr 22, 2024 âĒ 9