In a Training Loop 🔄
lewtun
·
AI & ML interests
LLMs, LLMs, LLMs
Recent Activity
Organizations
lewtun/gemma-7b-dpo-full-mix1-beta-0.6
Text Generation
• 9B • Updated • 3
lewtun/gemma-7b-dpo-full-mix1-beta-0.4
Text Generation
• 9B • Updated • 3
lewtun/gemma-7b-dpo-full-mix1-beta-0.2
Text Generation
• 9B • Updated • 3
lewtun/gemma-7b-dpo-full-mix1-beta-0.1
Text Generation
• 9B • Updated • 3
lewtun/gemma-7b-dpo-full-ultrafeedback-v0
Text Generation
• Updated • 2
lewtun/gemma-7b-dpo-full-mix-beta-0.1
Updated
lewtun/gemma-7b-dpo-full-orca-v0
Text Generation
• 9B • Updated • 4
lewtun/gemma-7b-sft-full-deita-10k-v0
Text Generation
• 9B • Updated • 5
lewtun/gemma-7b-sft-full-ultrachat-v0
Text Generation
• 9B • Updated • 8
• 1
lewtun/gemma-7b-sft-full-longest-1k-v1
Text Generation
• 9B • Updated • 3
lewtun/gemma-7b-sft-full-longest-1k-v0
Text Generation
• 9B • Updated • 1
lewtun/gemma-7b-sft-full-dolly-v3
Text Generation
• 9B • Updated • 1
lewtun/gemma-7b-sft-full-dolly-v2
Text Generation
• 9B • Updated • 1
lewtun/gemma-7b-sft-full-dolly-v1
Text Generation
• 9B • Updated • 1
lewtun/gemma-7b-sft-full-dolly-v0
Text Generation
• 9B • Updated • 2
Text Generation
• 0.5B • Updated • 5
lewtun/zephyr-7b-dpo-qlora-fix
lewtun/zephyr-7b-dpo-qlora-8e0975a
lewtun/zephyr-7b-dpo-qlora
lewtun/handbook-sft-qlora-test
Text Generation
• 7B • Updated • 4
lewtun/zephyr-7b-dpo-full
Text Generation
• 7B • Updated • 5
lewtun/zephyr-7b-sft-qlora
Text Classification
• 0.4B • Updated • 4
Text Generation
• 7B • Updated • 4
Text Generation
• 7B • Updated • 4
lewtun/mistral-7b-sft-ultrachat-arithmo-25
Text Generation
• Updated • 8
• lewtun/mistral-7b-sft-ultrachat-arithmo-50
Text Generation
• Updated • 10
• • 1
lewtun/mistral-7b-sft-ultrachat-arithmo-full
Text Generation
• Updated • 10
• • 1