FORT-Searcher: Synthesizing Shortcut-Resistant Search Tasks for Training Deep Search Agents Paper • 2606.12087 • Published 8 days ago • 73
WeaveBench: A Long-Horizon, Real-World Benchmark for Computer-Use Agents with Hybrid Interfaces Paper • 2606.09426 • Published 10 days ago • 100
i1: A Simple and Fully Open Recipe for Strong Text-to-Image Models Paper • 2606.11289 • Published 9 days ago • 13
World Pilot: Steering Vision-Language-Action Models with World-Action Priors Paper • 2606.12403 • Published 8 days ago • 25
VideoMDM: Towards 3D Human Motion Generation From 2D Supervision Paper • 2606.13364 • Published 7 days ago • 20
InternVideo3: Agentify Foundation Models with Multimodal Contextual Reasoning Paper • 2606.12195 • Published 8 days ago • 22
OmniVideo-100K: A Dataset for Audio-Visual Reasoning through Structured Scripts and Evidence Chains Paper • 2606.14702 • Published 6 days ago • 27
World Model Self-Distillation: Training World Models to Solve General Tasks Paper • 2606.12072 • Published 8 days ago • 13
LatentSkill: From In-Context Textual Skills to In-Weight Latent Skills for LLM Agents Paper • 2606.06087 • Published 14 days ago • 63
Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models Paper • 2606.11025 • Published 9 days ago • 41
FlashMemory-DeepSeek-V4: Lightning Index Ultra-Long Context via Lookahead Sparse Attention Paper • 2606.09079 • Published 10 days ago • 62