3 25 22

Andrew Zhao

andrewzh

https://andrewzh112.github.io/

AI & ML interests

Reinforcement Learning, Agents

Recent Activity

upvoted a paper 19 days ago

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

upvoted a paper 3 months ago

Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play

upvoted a paper 3 months ago

GEM: A Gym for Agentic LLMs

View all activity

Organizations

None yet

upvoted a paper 19 days ago

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

Paper • 2512.07461 • Published 23 days ago • 74

upvoted 3 papers 3 months ago

upvoted a paper 4 months ago

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10 • 190

upvoted a paper 5 months ago

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

Paper • 2507.21046 • Published Jul 28 • 82

upvoted a paper 6 months ago

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

Paper • 2506.24119 • Published Jun 30 • 50

upvoted 2 papers 7 months ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2 • 187

Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space

Paper • 2505.13308 • Published May 19 • 27

authored a paper 8 months ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6 • 188

updated a collection 8 months ago

Absolute Zero Reasoner

Collection

6 items • Updated May 9 • 56

upvoted a paper 8 months ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6 • 188

commented a paper 8 months ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6 • 188 •

updated 2 models 8 months ago

andrewzh/Absolute_Zero_Reasoner-Coder-14b

15B • Updated May 6 • 16 • 29

andrewzh/Absolute_Zero_Reasoner-Coder-3b

3B • Updated May 6 • 30 • 14

updated a collection 8 months ago

Absolute Zero Reasoner

Collection

6 items • Updated May 9 • 56

published 2 models 8 months ago

andrewzh/Absolute_Zero_Reasoner-Coder-14b

15B • Updated May 6 • 16 • 29

andrewzh/Absolute_Zero_Reasoner-Coder-3b

3B • Updated May 6 • 30 • 14

updated a collection 8 months ago

Absolute Zero Reasoner

Collection

6 items • Updated May 9 • 56

Andrew Zhao

AI & ML interests

Recent Activity

Organizations

andrewzh's activity