Attention Amnesia in Hybrid LLMs: When CoT Fine-Tuning Breaks Long-Range Recall, and How to Fix It Paper • 2606.11052 • Published 7 days ago • 15
EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL Paper • 2605.18703 • Published 29 days ago • 50
Acccordion-Thinking Collection Accordion-Thinking: Self-Regulated Step Summaries for Efficient and Readable LLM Reasoning (https://arxiv.org/abs/2602.03249) • 5 items • Updated Apr 10 • 1
Acccordion-Thinking Collection Accordion-Thinking: Self-Regulated Step Summaries for Efficient and Readable LLM Reasoning (https://arxiv.org/abs/2602.03249) • 5 items • Updated Apr 10 • 1