pplx-embed Collection Diffusion-Pretrained Dense and Contextual Embeddings • 7 items • Updated Feb 26 • 96
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 ggerganov, ngxson, allozaur, lysandre, victor, julien-c • Feb 20 • 505
Nemotron Speech Collection Open, state-of-the-art, production‑ready enterprise speech models from the NVIDIA Speech research team for ASR, TTS, Speaker Diarization and S2S • 12 items • Updated 4 days ago • 51
view article Article We Got Claude to Fine-Tune an Open Source LLM burtenshaw, evalstate • Dec 4, 2025 • 624
view article Article Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers +5 ariG23498, sergiopaniego, reach-vb, pcuenq, ArthurZ, SaylorTwift, cyrilvallez • Sep 11, 2025 • 187
JSON Mode Reasoning Collection A collection of structured outputs reasoning dataset • 3 items • Updated Jul 23, 2025 • 3
Tool Use Reasoning Collection A collection of tool use reasoning dataset in Hermes format • 5 items • Updated Jul 23, 2025 • 9
NeMo Curator - Classifier Models Collection Classifier models that can be used in NeMo Curator for labelling/filtering datasets. • 13 items • Updated about 16 hours ago • 27
Finance Commons Collection A large collection of multimodal financial documents in open data. • 7 items • Updated Jul 17, 2024 • 14
BitNet: Scaling 1-bit Transformers for Large Language Models Paper • 2310.11453 • Published Oct 17, 2023 • 107
LLM Speculative Decoding Experiments Collection Tiny language models meant to serve as draft models for speculative decoding. • 6 items • Updated Aug 1, 2025 • 3
The Big Benchmarks Collection Collection Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard) • 13 items • Updated Nov 18, 2024 • 265
Zephyr 7B Collection Models, datasets, and demos associated with Zephyr 7B. For code to train the models, see: https://github.com/huggingface/alignment-handbook • 8 items • Updated Mar 2 • 153
Tiny Series Collection Tiny datasets that empower the foundation of Small Language Model! • 14 items • Updated 1 day ago • 44
ELECTRA release Collection This collection regroups the ELECTRA models released by the Google team. • 6 items • Updated Mar 12 • 13
CommonGen: A Constrained Text Generation Challenge for Generative Commonsense Reasoning Paper • 1911.03705 • Published Nov 9, 2019 • 1