view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 Feb 20 • 490
view article Article I Let a Lobster Run My Jetson: What OpenClaw Taught Me About the Future of Computing Feb 19 • 15
view article Article LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search tooling Feb 12 • 51
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM +2 Mar 12, 2025 • 492
view article Article From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub +2 Feb 12, 2025 • 80
view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference Jan 16, 2025 • 76
view article Article Train 400x faster Static Embedding Models with Sentence Transformers Jan 15, 2025 • 228
Cosmos Collection ⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/nvidia-cosmos-2 • 14 items • Updated about 18 hours ago • 300
view article Article Releasing Outlines-core 0.1.0: structured generation in Rust and Python +5 Oct 22, 2024 • 44