Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
5
liyaxuan
lllyx
Follow
0 followers
ยท
1 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
10 minutes ago
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization
upvoted
a
paper
17 days ago
Pre-training Distillation for Large Language Models: A Design Space Exploration
upvoted
a
paper
3 months ago
A Survey of Reinforcement Learning for Large Reasoning Models
View all activity
Organizations
None yet
spaces
1
Sleeping
ML Patch
๐
Submit data for inference and view results
models
0
None public yet
datasets
0
None public yet