1 16 13

Norris Wheeler

wheeler404

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

From Reasoning to Agentic: Credit Assignment in Reinforcement Learning for Large Language Models

upvoted a paper 6 days ago

Mobile GUI Agent Privacy Personalization with Trajectory Induced Preference Optimization

upvoted a paper 6 days ago

TRACE: Capability-Targeted Agentic Training

View all activity

Organizations

None yet

upvoted 14 papers 6 days ago

From Reasoning to Agentic: Credit Assignment in Reinforcement Learning for Large Language Models

Paper • 2604.09459 • Published 8 days ago • 13

Mobile GUI Agent Privacy Personalization with Trajectory Induced Preference Optimization

Paper • 2604.11259 • Published 8 days ago • 12

TRACE: Capability-Targeted Agentic Training

Paper • 2604.05336 • Published 14 days ago • 13

Agentic Aggregation for Parallel Scaling of Long-Horizon Agentic Tasks

Paper • 2604.11753 • Published 8 days ago • 14

AgentSwing: Adaptive Parallel Context Management Routing for Long-Horizon Web Agents

Paper • 2603.27490 • Published 23 days ago • 17

KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance

Paper • 2604.12627 • Published 7 days ago • 97

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published 7 days ago • 83

Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation

Paper • 2604.10098 • Published 10 days ago • 75

The Past Is Not Past: Memory-Enhanced Dynamic Reward Shaping

Paper • 2604.11297 • Published 8 days ago • 136

ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents

Paper • 2604.11784 • Published 8 days ago • 140

liked a model about 1 month ago

docling-project/SmolDocling-256M-preview

Image-Text-to-Text • Updated Sep 17, 2025 • 37.9k • 1.61k

liked a dataset 4 months ago

opencsg/Fineweb-Edu-Chinese-V2.1

Viewer • Updated Jan 28 • 958M • 11.2k • 73

liked a model 5 months ago

Qwen/Qwen3-VL-2B-Instruct

Image-Text-to-Text • 2B • Updated Oct 23, 2025 • 72.4M • 370

updated a dataset 5 months ago

wheeler404/catgirl_sft_15k

Viewer • Updated Nov 14, 2025 • 15k • 31 • 1

published a dataset 5 months ago

wheeler404/catgirl_sft_15k

Viewer • Updated Nov 14, 2025 • 15k • 31 • 1

liked a dataset 5 months ago

databricks/databricks-dolly-15k

Viewer • Updated Jun 30, 2023 • 15k • 33.4k • 951

Norris Wheeler

AI & ML interests

Recent Activity

Organizations

wheeler404's activity