Itay Levy's picture

19 1

Itay Levy

itlevy

·

AI & ML interests

None yet

Recent Activity

new activity 27 days ago

nvidia/Llama-3_3-Nemotron-Super-49B-v1_5-NVFP4:Llama-3_3-Nemotron-Super-49B-v1_5-NVFP4/llama_nemotron_toolcall_parser_no_streaming.py missing

new activity 27 days ago

nvidia/Llama-3_3-Nemotron-Super-49B-v1_5-NVFP4:Update README and toolcall_parser

updated a collection about 1 month ago

View all activity

Organizations

New activity in nvidia/Llama-3_3-Nemotron-Super-49B-v1_5-NVFP4 27 days ago

Llama-3_3-Nemotron-Super-49B-v1_5-NVFP4/llama_nemotron_toolcall_parser_no_streaming.py missing

#1 opened about 1 month ago by

Update README and toolcall_parser

#5 opened 27 days ago by

updated a collection about 1 month ago

Llama Nemotron

Open, Production-ready Enterprise Models • 12 items • Updated about 20 hours ago • 75

published a model about 1 month ago

nvidia/Llama-3_3-Nemotron-Super-49B-v1_5-NVFP4

Text Generation • 26B • Updated 27 days ago • 2.4k • 13

updated a model about 1 month ago

nvidia/Llama-3_3-Nemotron-Super-49B-v1_5-NVFP4

Text Generation • 26B • Updated 27 days ago • 2.4k • 13

New activity in nvidia/Llama-3_1-Nemotron-Ultra-253B-v1 6 months ago

_prepare_generation_config bugfix (failed due to version update in transformers)

#14 opened 6 months ago by

New activity in nvidia/Llama-3_1-Nemotron-51B-Instruct 6 months ago

_prepare_generation_config bugfix (failed due to version update in transformers)

#25 opened 6 months ago by

New activity in nvidia/Llama-3_1-Nemotron-Ultra-253B-CPT-v1 6 months ago

_prepare_generation_config bugfix (failed due to version update in transformers)

#2 opened 6 months ago by

published a model 9 months ago

nvidia/Llama-3_1-Nemotron-Ultra-253B-CPT-v1

Text Generation • 253B • Updated Jul 6 • 77 • 6

updated a model 9 months ago

nvidia/Llama-3_1-Nemotron-51B-Instruct

Text Generation • 52B • Updated Jul 6 • 1.17k • 209

New activity in nvidia/Llama-3_3-Nemotron-Super-49B-v1 9 months ago

Nemotron 253B?

#10 opened 9 months ago by

New activity in nvidia/Llama-3_1-Nemotron-Ultra-253B-v1 9 months ago

How come this pruned model has 162 layers

#3 opened 9 months ago by

updated a model 9 months ago

nvidia/Llama-3_1-Nemotron-Ultra-253B-CPT-v1

Text Generation • 253B • Updated Jul 6 • 77 • 6

New activity in nvidia/Llama-3_1-Nemotron-Ultra-253B-CPT-v1 9 months ago

add model card

#1 opened 9 months ago by

authored 2 papers 9 months ago

FFN Fusion: Rethinking Sequential Computation in Large Language Models

Paper • 2503.18908 • Published Mar 24 • 19

FFN Fusion: Rethinking Sequential Computation in Large Language Models

Paper • 2503.18908 • Published Mar 24 • 19

authored a paper about 1 year ago

Puzzle: Distillation-Based NAS for Inference-Optimized LLMs

Paper • 2411.19146 • Published Nov 28, 2024 • 17

New activity in nvidia/Llama-3_1-Nemotron-51B-Instruct about 1 year ago

Patching hf bug that creates wrong cache length if only inputs_embeds are passed to the model

#19 opened about 1 year ago by

DeciLMForCausalLM(DeciLMPreTrainedModel, GenerationMixin) for v4.50

#16 opened about 1 year ago by

add batch_size attribute to VariableCache

#15 opened about 1 year ago by