Llama Nemotron Collection Open, Production-ready Enterprise Models • 12 items • Updated about 20 hours ago • 75
nvidia/Llama-3_3-Nemotron-Super-49B-v1_5-NVFP4 Text Generation • 26B • Updated 27 days ago • 2.4k • 13
nvidia/Llama-3_3-Nemotron-Super-49B-v1_5-NVFP4 Text Generation • 26B • Updated 27 days ago • 2.4k • 13
FFN Fusion: Rethinking Sequential Computation in Large Language Models Paper • 2503.18908 • Published Mar 24 • 19
FFN Fusion: Rethinking Sequential Computation in Large Language Models Paper • 2503.18908 • Published Mar 24 • 19
Puzzle: Distillation-Based NAS for Inference-Optimized LLMs Paper • 2411.19146 • Published Nov 28, 2024 • 17