Lewis Tunstall's picture

In a Training Loop 🔄

Lewis Tunstall PRO

lewtun

huggingface

·

https://lewtun.github.io/blog/

AI & ML interests

LLMs, LLMs, LLMs

Recent Activity

upvoted an article 3 days ago

Reachy Mini goes fully local

upvoted an article 4 days ago

Shipping a Trillion Parameters With a Hub Bucket: Delta Weight Sync in TRL

published an article 4 days ago

Shipping a Trillion Parameters With a Hub Bucket: Delta Weight Sync in TRL

View all activity

Organizations

lewtun 's models 317

lewtun/gemma-7b-dpo-full-mix1-beta-0.6

Text Generation • 9B • Updated Feb 29, 2024 • 3

lewtun/gemma-7b-dpo-full-mix1-beta-0.4

Text Generation • 9B • Updated Feb 29, 2024 • 3

lewtun/gemma-7b-dpo-full-mix1-beta-0.2

Text Generation • 9B • Updated Feb 29, 2024 • 3

lewtun/gemma-7b-dpo-full-mix1-beta-0.1

Text Generation • 9B • Updated Feb 29, 2024 • 3

lewtun/gemma-7b-dpo-full-ultrafeedback-v0

Text Generation • Updated Feb 29, 2024 • 2

lewtun/gemma-7b-dpo-full-mix-beta-0.1

Updated Feb 29, 2024

lewtun/gemma-7b-dpo-full-orca-v0

Text Generation • 9B • Updated Feb 29, 2024 • 4

lewtun/gemma-7b-sft-full-deita-10k-v0

Text Generation • 9B • Updated Feb 29, 2024 • 5

lewtun/gemma-7b-sft-full-ultrachat-v0

Text Generation • 9B • Updated Feb 29, 2024 • 8 • 1

lewtun/gemma-7b-sft-full-longest-1k-v1

Text Generation • 9B • Updated Feb 29, 2024 • 3

lewtun/gemma-7b-sft-full-longest-1k-v0

Text Generation • 9B • Updated Feb 29, 2024 • 1

lewtun/gemma-7b-sft-full-dolly-v3

Text Generation • 9B • Updated Feb 29, 2024 • 1

lewtun/gemma-7b-sft-full-dolly-v2

Text Generation • 9B • Updated Feb 29, 2024 • 1

lewtun/gemma-7b-sft-full-dolly-v1

Text Generation • 9B • Updated Feb 29, 2024 • 1

lewtun/gemma-7b-sft-full-dolly-v0

Text Generation • 9B • Updated Feb 29, 2024 • 2

lewtun/dummy-model

Text Generation • 0.5B • Updated Feb 21, 2024 • 5

lewtun/zephyr-7b-dpo-qlora-fix

Updated Feb 2, 2024 • 3

lewtun/zephyr-7b-dpo-qlora-8e0975a

Updated Jan 10, 2024 • 5

lewtun/zephyr-7b-dpo-qlora

Updated Jan 9, 2024 • 7

lewtun/handbook-sft-qlora-test

Updated Jan 9, 2024 • 6

lewtun/handbook-sft-test

Text Generation • 7B • Updated Jan 9, 2024 • 4

lewtun/zephyr-7b-dpo-full

Text Generation • 7B • Updated Jan 5, 2024 • 5

lewtun/zephyr-7b-sft-qlora

Updated Jan 4, 2024 • 10

lewtun/kato-dummy

Text Classification • 0.4B • Updated Dec 22, 2023 • 4

lewtun/test-upload

Updated Dec 10, 2023

lewtun/zephyr-7b-sft

Text Generation • 7B • Updated Nov 9, 2023 • 4

lewtun/zephyr-7b-dpo

Text Generation • 7B • Updated Nov 8, 2023 • 4

lewtun/mistral-7b-sft-ultrachat-arithmo-25

Text Generation • Updated Oct 21, 2023 • 8 •

lewtun/mistral-7b-sft-ultrachat-arithmo-50

Text Generation • Updated Oct 21, 2023 • 10 • • 1

lewtun/mistral-7b-sft-ultrachat-arithmo-full

Text Generation • Updated Oct 21, 2023 • 10 • • 1