A collection of preference datasets used for training and evaluation of code reward models.
Themis
community
AI & ML interests
None defined yet.
Recent Activity
View all activity
A collection of preference datasets used for training and evaluation of code reward models.
A collection of strong code reward models trained on a diverse collection of code preferences.
A collection of preference model pretraining checkpoints trained on general preference datasets intended as precursors for code reward models.