18 36 64

PenutChen

penut85420

penut85420

AI & ML interests

LLM, Quantization

Recent Activity

liked a model 2 months ago

microsoft/VibeVoice-AcousticTokenizer

upvoted an article 9 months ago

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

upvoted a paper 10 months ago

Unifying Demonstration Selection and Compression for In-Context Learning

View all activity

Organizations

liked a model 2 months ago

microsoft/VibeVoice-AcousticTokenizer

Feature Extraction • Updated Feb 6 • 716 • 14

upvoted an article 9 months ago

Article

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

drbh, danieldk

•

Aug 18, 2025

• 100

upvoted a paper 10 months ago

Unifying Demonstration Selection and Compression for In-Context Learning

Paper • 2405.17062 • Published May 27, 2024 • 1

liked a model 11 months ago

chandar-lab/NeoBERT

Feature Extraction • 0.2B • Updated Mar 25, 2025 • 19k • 197

upvoted a paper 11 months ago

TASTE: Text-Aligned Speech Tokenization and Embedding for Spoken Language Modeling

Paper • 2504.07053 • Published Apr 9, 2025 • 6

liked a dataset 11 months ago

institutional/institutional-books-1.0

Viewer • Updated 30 days ago • 983k • 6.75k • 277

updated a Space 12 months ago

HelloGradio

🏢

Test your Japanese kana knowledge quiz-style

published a Space 12 months ago

HelloGradio

🏢

Test your Japanese kana knowledge quiz-style

updated a Space 12 months ago

Test

📚

Take a Japanese kana quiz to test your knowledge 🚀

published a Space 12 months ago

Test

📚

Take a Japanese kana quiz to test your knowledge 🚀

updated a Space 12 months ago

JpVocab

✏

Take a Japanese vocabulary quiz

commented on 🐯 Liger GRPO meets TRL 12 months ago

Sounds perfect!

upvoted an article 12 months ago

Article

🐯 Liger GRPO meets TRL

shisahni, kashif, smohammadi, ShirinYamani, m0m0chen, liberty4321

•

May 25, 2025

• 53

commented on 🐯 Liger GRPO meets TRL 12 months ago

Does Liger Kernel affect training speed at all? Is it faster, slower, or no difference compared to regular GRPO?

updated a Space about 1 year ago

KanaQuiz

📝

Take a kana quiz to practice Japanese hiragana and katakana

upvoted a paper about 1 year ago

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Paper • 2505.09568 • Published May 14, 2025 • 99

liked a Space about 1 year ago

Computer Agent

🖥

984

Interact with an AI agent to perform web tasks

liked a model about 1 year ago

JetBrains/Mellum-4b-base

Text Generation • 4B • Updated May 7, 2025 • 3.13k • • 443

upvoted a collection about 1 year ago

Tiny Series

Collection

Tiny datasets that empower the foundation of Small Language Model! • 14 items • Updated 11 days ago • 44

liked a model about 1 year ago

Qwen/Qwen2.5-Omni-7B

Any-to-Any • 11B • Updated Apr 30, 2025 • 885k • 1.9k

PenutChen

AI & ML interests

Recent Activity

Organizations

penut85420's activity

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

HelloGradio

HelloGradio

Test

Test

JpVocab

🐯 Liger GRPO meets TRL

KanaQuiz

Computer Agent