Lewis Tunstall's picture

In a Training Loop 🔄

Lewis Tunstall PRO

lewtun

huggingface

·

https://lewtun.github.io/blog/

AI & ML interests

LLMs, LLMs, LLMs

Recent Activity

upvoted an article 3 days ago

Reachy Mini goes fully local

upvoted an article 3 days ago

Shipping a Trillion Parameters With a Hub Bucket: Delta Weight Sync in TRL

published an article 4 days ago

Shipping a Trillion Parameters With a Hub Bucket: Delta Weight Sync in TRL

View all activity

Organizations

lewtun 's models 317

lewtun/Llama-3.1-8B-SFT-LoRA-packing-no-lm-head

Updated Sep 30, 2024

lewtun/Llama-3.1-8B-SFT-LoRA-no-packing

Updated Sep 30, 2024

lewtun/Llama-3.1-8B-SFT-QLoRA-packing

Updated Sep 30, 2024

lewtun/Llama-3.1-8B-SFT-LoRA-packing-no-saved-modules

Updated Sep 30, 2024

lewtun/Llama-3.1-8B-SFT-LoRA-packing

Updated Sep 30, 2024

lewtun/Llama-3.1-8B-SFT-LoRA-packing-pad-token-eos

Updated Sep 30, 2024

lewtun/Llama-3.1-8B-SFT-QLoRA-packing-pad-token-eos

Updated Sep 30, 2024

lewtun/Llama-3.1-8B-SFT-full-packing

Text Generation • 8B • Updated Sep 30, 2024 • 2

lewtun/Llama-3.1-8B-SFT-LoRA

Updated Sep 27, 2024

lewtun/Qwen2-0.5B-Reward

Text Classification • 0.5B • Updated Sep 23, 2024 • 3

lewtun/gemma-2-2b-it-gkd-9b

Updated Sep 14, 2024

lewtun/gemma-2-2b-it-gkd-27b

Updated Sep 14, 2024

lewtun/gemma-2-2b-it-gkd

Updated Sep 14, 2024

lewtun/gemma-2-2b-gkd

Updated Sep 14, 2024

lewtun/tmp-dpo

Text Generation • 1.03M • Updated Sep 11, 2024 • 1

lewtun/dpo-model

Updated Sep 9, 2024

lewtun/dpo-model-lora

Updated Sep 9, 2024 • 2

lewtun/sft_openassistant-guanaco

Updated Sep 9, 2024

lewtun/reward-model

Text Classification • 0.5B • Updated Sep 5, 2024 • 1

lewtun/pythia-6.9b-deduped-tldr-online-dpo

7B • Updated Aug 28, 2024 • 1

lewtun/qwen2-1.5B-ultrafeedback-online-dpo

2B • Updated Aug 28, 2024 • 2

lewtun/qwen2-0.5B-ultrafeedback-online-dpo

0.6B • Updated Aug 28, 2024 • 2

lewtun/pythia-2.8b-deduped-tldr-online-dpo

3B • Updated Aug 27, 2024 • 2

lewtun/qwen2-7B-ultrafeedback-online-dpo-bs-1

Updated Aug 27, 2024

lewtun/qwen2-7B-ultrafeedback-online-dpo-bs-2

Updated Aug 27, 2024

lewtun/qwen2-7B-ultrafeedback-online-dpo

Updated Aug 27, 2024

lewtun/pythia-1b-deduped-tldr-online-dpo

1B • Updated Aug 27, 2024 • 2

lewtun/pythia-1b-tldr-online-dpo

Updated Aug 27, 2024

lewtun/qwen2-0.5B-lr-5e-7

Updated Aug 27, 2024

lewtun/qwen2-7B-lr-3e-6

Updated Aug 26, 2024