R$^3$-SQL: Ranking Reward and Resampling for Text-to-SQL Paper • 2604.25325 • Published 15 days ago • 2 • 2
PACEvolve++: Improving Test-time Learning for Evolutionary Search Agents Paper • 2605.07039 • Published 6 days ago • 3 • 2
HumanNet: Scaling Human-centric Video Learning to One Million Hours Paper • 2605.06747 • Published 6 days ago • 46 • 1
Steering Visual Generation in Unified Multimodal Models with Understanding Supervision Paper • 2605.05781 • Published 6 days ago • 3 • 2
CGM-JEPA: Learning Consistent Continuous Glucose Monitor Representations via Predictive Self-Supervised Pretraining Paper • 2605.00933 • Published 12 days ago • 2 • 2
Listwise Policy Optimization: Group-based RLVR as Target-Projection on the LLM Response Simplex Paper • 2605.06139 • Published 6 days ago • 62 • 2
MDN: Parallelizing Stepwise Momentum for Delta Linear Attention Paper • 2605.05838 • Published 6 days ago • 4 • 2
From Storage to Experience: A Survey on the Evolution of LLM Agent Memory Mechanisms Paper • 2605.06716 • Published 6 days ago • 5 • 2
Scaling Continual Learning to 300+ Tasks with Bi-Level Routing Mixture-of-Experts Paper • 2602.03473 • Published 5 days ago • 11 • 2
HyperEyes: Dual-Grained Efficiency-Aware Reinforcement Learning for Parallel Multimodal Search Agents Paper • 2605.07177 • Published 5 days ago • 57 • 2
IntentGrasp: A Comprehensive Benchmark for Intent Understanding Paper • 2605.06832 • Published 6 days ago • 6 • 2
Empirical Evidence for Simply Connected Decision Regions in Image Classifiers Paper • 2605.06380 • Published 6 days ago • 3 • 2
Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers Paper • 2605.06169 • Published 6 days ago • 116 • 2
SkCC: Portable and Secure Skill Compilation for Cross-Framework LLM Agents Paper • 2605.03353 • Published 8 days ago • 6 • 4
Discovering Reinforcement Learning Interfaces with Large Language Models Paper • 2605.03408 • Published 8 days ago • 3 • 2
Q-RAG: Long Context Multi-step Retrieval via Value-based Embedder Training Paper • 2511.07328 • Published 9 days ago • 12 • 2
SpecBlock: Block-Iterative Speculative Decoding with Dynamic Tree Drafting Paper • 2605.07243 • Published 5 days ago • 3 • 3
A$^2$RD: Agentic Autoregressive Diffusion for Long Video Consistency Paper • 2605.06924 • Published 6 days ago • 13 • 2
Beyond Retrieval: A Multitask Benchmark and Model for Code Search Paper • 2605.04615 • Published 7 days ago • 22 • 2