Reading list
updated
LoFT: Parameter-Efficient Fine-Tuning for Long-tailed Semi-Supervised
Learning in Open-World Scenarios
Paper
• 2509.09926
• Published
• 14
What Breaks Knowledge Graph based RAG? Empirical Insights into Reasoning
under Incomplete Knowledge
Paper
• 2508.08344
• Published
MemMamba: Rethinking Memory Patterns in State Space Model
Paper
• 2510.03279
• Published
• 73
When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs
Paper
• 2510.07499
• Published
• 48
TokDrift: When LLM Speaks in Subwords but Code Speaks in Grammar
Paper
• 2510.14972
• Published
• 35
ReCode: Unify Plan and Action for Universal Granularity Control
Paper
• 2510.23564
• Published
• 122
Code Aesthetics with Agentic Reward Feedback
Paper
• 2510.23272
• Published
• 9
BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via
Balanced Policy Optimization with Adaptive Clipping
Paper
• 2510.18927
• Published
• 84
ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool
Use
Paper
• 2510.27363
• Published
• 23
Unlocking the conversion of Web Screenshots into HTML Code with the
WebSight Dataset
Paper
• 2403.09029
• Published
• 56
Physics of Language Models: Part 4.1, Architecture Design and the Magic of Canon Layers
Paper
• 2512.17351
• Published
• 28
Memory in the Age of AI Agents
Paper
• 2512.13564
• Published
• 151