Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Sunny Sanyal's picture
5 1 3

Sunny Sanyal

Sunny111
ryanmarten's profile picture thomwolf's profile picture abao's profile picture
ยท
https://sites.google.com/view/sunnysanyal/home
  • SunnySanyal9
  • sanyalsunny111

AI & ML interests

Efficient Training Recipes of Large Models (mostly LLMs)

Recent Activity

posted an update about 2 hours ago
Are you familiar with reverse residual connections or looping in language models? Excited to share my Looped-GPT blog post and codebase ๐Ÿš€ https://github.com/sanyalsunny111/Looped-GPT TL;DR: looping during pre-training improves generalization. Plot shows GPT2 LMs pre-trained with 15.73B OWT tokens P.S. This is my first post here โ€” I have ~4 followers and zero expectations for reach ๐Ÿ˜„
upvoted a paper 29 days ago
Pre-training Small Base LMs with Fewer Tokens
liked a model about 1 month ago
GuminiResearch/Gumini-1.5B-Base
View all activity

Organizations

University of Texas at Austin's profile picture ML Foundations's profile picture Institute for Foundations of Machine Learning's profile picture

Sunny111 's datasets

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs