4 454

M Saad Salman

MSS444

MSS444

AI & ML interests

None yet

Recent Activity

upvoted a paper about 20 hours ago

Reaching Beyond the Mode: RL for Distributional Reasoning in Language Models

upvoted a paper about 20 hours ago

Voxtral TTS

upvoted a paper about 20 hours ago

Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills

View all activity

Organizations

None yet

upvoted 3 papers about 20 hours ago

upvoted 2 papers 4 days ago

StreamingClaw Technical Report

Paper • 2603.22120 • Published 7 days ago • 9

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

Paper • 2603.24472 • Published 5 days ago • 41

upvoted 5 papers 6 days ago

Regulating AI Agents

Paper • 2603.23471 • Published 6 days ago • 4

From Static Templates to Dynamic Runtime Graphs: A Survey of Workflow Optimization for LLM Agents

Paper • 2603.22386 • Published 7 days ago • 54

SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning

Paper • 2603.23483 • Published 6 days ago • 58

UniGRPO: Unified Policy Optimization for Reasoning-Driven Visual Generation

Paper • 2603.23500 • Published 6 days ago • 35

Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMs

Paper • 2603.22446 • Published 7 days ago • 6

upvoted 10 papers 7 days ago

SWE-Skills-Bench: Do Agent Skills Actually Help in Real-World Software Engineering?

Paper • 2603.15401 • Published 15 days ago • 18

InCoder-32B: Code Foundation Model for Industrial Scenarios

Paper • 2603.16790 • Published 13 days ago • 304

AI Scientist via Synthetic Task Scaling

Paper • 2603.17216 • Published 13 days ago • 4

Efficient Exploration at Scale

Paper • 2603.17378 • Published 13 days ago • 13

Complementary Reinforcement Learning

Paper • 2603.17621 • Published 13 days ago • 36

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Paper • 2603.17187 • Published 13 days ago • 134

Efficient Reasoning with Balanced Thinking

Paper • 2603.12372 • Published 18 days ago • 143

What Really Controls Temporal Reasoning in Large Language Models: Tokenisation or Representation of Time?

Paper • 2603.19017 • Published 12 days ago • 3

Human-AI Synergy in Agentic Code Review

Paper • 2603.15911 • Published 14 days ago • 4

Teaching an Agent to Sketch One Part at a Time

Paper • 2603.19500 • Published 11 days ago • 5

M Saad Salman

AI & ML interests

Recent Activity

Organizations

MSS444's activity