P Lin
pufflin
AI & ML interests
None yet
Recent Activity
upvoted a paper 17 days ago
Guidance Contrastive Token Credit Assignment for Discrete Policy Optimization upvoted a paper 17 days ago
Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses liked a model 17 days ago
pat-jj/harness-1Organizations
None yet