view post Post 8897 Gemma 4 12B can now run locally on just 8GB RAM via Dynamic GGUFs.Google's new model, Gemma 4 12B Unified supports image, audio and 256K context.You can run and train the model via Unsloth Studio.GGUF: unsloth/gemma-4-12b-it-GGUFGuide: https://unsloth.ai/docs/models/gemma-4 See translation 5 replies · 🔥 43 43 👍 13 13 🤗 2 2 + Reply
nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-NVFP4 Text Generation • 335B • Updated 3 days ago • 153k • • 172
nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16 Text Generation • 561B • Updated 3 days ago • 67.2k • 203
nvidia/nemotron-3.5-asr-streaming-0.6b Automatic Speech Recognition • Updated 7 days ago • 3.55k • • 394
view article Article How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent nvidia • 9 days ago • 53
Running Featured 109 Cohere Transcribe WebGPU ⚡ 109 Run Cohere Transcribe locally in your browser on WebGPU.