Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information Paper • 2605.11609 • Published 15 days ago • 191
SkillsVote: Lifecycle Governance of Agent Skills from Collection, Recommendation to Evolution Paper • 2605.18401 • Published 9 days ago • 125
Boosting Reinforcement Learning with Verifiable Rewards via Randomly Selected Few-Shot Guidance Paper • 2605.15012 • Published 13 days ago • 4
Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers Paper • 2605.06169 • Published 20 days ago • 228
HERMES++: Toward a Unified Driving World Model for 3D Scene Understanding and Generation Paper • 2604.28196 • Published 27 days ago • 71
Seeing Isn't Believing: Uncovering Blind Spots in Evaluator Vision-Language Models Paper • 2604.21523 • Published Apr 23 • 3
openai/whisper-large-v3-turbo Automatic Speech Recognition • 0.8B • Updated Oct 4, 2024 • 7.91M • • 3.04k
hector-gr/RLCR-v4-ks-uniqueness-cov0-gapece-cold-math Text Generation • 8B • Updated Apr 10 • 34 • • 1
UniRecGen: Unifying Multi-View 3D Reconstruction and Generation Paper • 2604.01479 • Published Apr 1 • 7
Paper Circle: An Open-source Multi-agent Research Discovery and Analysis Framework Paper • 2604.06170 • Published Apr 7 • 31
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published Apr 2 • 504