Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models Paper • 2512.24618 • Published 3 days ago • 57
TACO: Think-Answer Consistency for Optimized Long-Chain Reasoning and Efficient Data Learning via Reinforcement Learning in LVLMs Paper • 2505.20777 • Published May 27, 2025
Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization Paper • 2512.24615 • Published 3 days ago • 1
SmartSnap: Proactive Evidence Seeking for Self-Verifying Agents Paper • 2512.22322 • Published 7 days ago • 35
ActiveVLN: Towards Active Exploration via Multi-Turn RL in Vision-and-Language Navigation Paper • 2509.12618 • Published Sep 16, 2025 • 1
LTD-Bench: Evaluating Large Language Models by Letting Them Draw Paper • 2511.02347 • Published Nov 4, 2025 • 8
Adaptive Dual Reasoner: Large Reasoning Models Can Think Efficiently by Hybrid Reasoning Paper • 2510.10207 • Published Oct 11, 2025
RoRecomp: Enhancing Reasoning Efficiency via Rollout Response Recomposition in Reinforcement Learning Paper • 2509.25958 • Published Sep 30, 2025
SSA: Sparse Sparse Attention by Aligning Full and Sparse Attention Outputs in Feature Space Paper • 2511.20102 • Published Nov 25, 2025 • 27
SoftCLIP: Softer Cross-modal Alignment Makes CLIP Stronger Paper • 2303.17561 • Published Mar 30, 2023
VITA-E: Natural Embodied Interaction with Concurrent Seeing, Hearing, Speaking, and Acting Paper • 2510.21817 • Published Oct 21, 2025 • 41
VITA-VLA: Efficiently Teaching Vision-Language Models to Act via Action Expert Distillation Paper • 2510.09607 • Published Oct 10, 2025 • 2
Aligning and Prompting Everything All at Once for Universal Visual Perception Paper • 2312.02153 • Published Dec 4, 2023
MAC-SQL: A Multi-Agent Collaborative Framework for Text-to-SQL Paper • 2312.11242 • Published Dec 18, 2023
MAC-SQL: A Multi-Agent Collaborative Framework for Text-to-SQL Paper • 2312.11242 • Published Dec 18, 2023
Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning Paper • 2509.22601 • Published Sep 26, 2025 • 29
TPLA: Tensor Parallel Latent Attention for Efficient Disaggregated Prefill \& Decode Inference Paper • 2508.15881 • Published Aug 21, 2025 • 9
Youtu-GraphRAG: Vertically Unified Agents for Graph Retrieval-Augmented Complex Reasoning Paper • 2508.19855 • Published Aug 27, 2025 • 7