2 7 25

Frank Denis PRO

jedisct1

https://00f.net

AI & ML interests

Cryptography.

Recent Activity

updated a Space about 8 hours ago

jedisct1/swival-playground

updated a dataset about 13 hours ago

jedisct1/agent-traces-swival

reacted to qgallouedec's post with 🔥 1 day ago

Shipped hf-sandbox! 🥡 🧪 Running an eval that executes model-generated C on a few thousand prompts? You probably don't want any of that on your laptop. Just shipped hf-sandbox, a Modal-style sandbox API on top of Hugging Face Jobs. Spin up an isolated, ephemeral container, run untrusted code, get the result back. No Docker on your laptop, no infra to manage. Just pip install hf-sandbox. Early days (v0.1); feedback and issues very welcome: 👉 https://github.com/huggingface/hf-sandbox

View all activity

Organizations

updated a Space about 8 hours ago

Swival

🔁

Swival AI coding agent

updated a dataset about 13 hours ago

jedisct1/agent-traces-swival

Traces • Updated about 2 hours ago • 33.5k • 5.45k • 5

reacted to qgallouedec's post with 🔥 1 day ago

Post

9459

Shipped hf-sandbox! 🥡

🧪 Running an eval that executes model-generated C on a few thousand prompts? You probably don't want any of that on your laptop.
Just shipped hf-sandbox, a Modal-style sandbox API on top of Hugging Face Jobs. Spin up an isolated, ephemeral container, run untrusted code, get the result back. No Docker on your laptop, no infra to manage.

Just pip install hf-sandbox.

Early days (v0.1); feedback and issues very welcome:
👉 https://github.com/huggingface/hf-sandbox

1 reply

liked a model 7 days ago

Youssofal/Qwen3.6-27B-MTPLX-Optimized-Speed

Text Generation • 5B • Updated 8 days ago • 7.26k • 13

liked a model 8 days ago

HuggingFaceTB/nanowhale-100m

Text Generation • 0.1B • Updated 8 days ago • 2.19k • 51

reacted to Crownelius's post with 🔥 12 days ago

Post

5898

My Huggingface journey has been a trip!
I wanted to take the time to thank each and every one of you for using my dataset and getting it to go as far as it did. Believe it or not, some neanderthal was and maybe still is trending on huggingface.

Not only did my dataset reach number one, my fine-tuned qwen3.5 model did as well. Top 10. Honestly, ain't much left to do here.

Y'all have given me the desire, no... the craving for more. I am absolutely obsessed with AI now. I want to tweak it... I want to take it apart, just to see what makes everything tick. I want to put it together like Frankenstein and his monster.

The only thing that's stopping this guy is compute. I don't mind spending every penny I have on this. I desperately want to drive AI forward, even just a little bit.

I never knew the clanker hater from a year ago would be saying this.

Thank you all from the bottom of my heart.

Looking forward to showing you what I'm cooking up next. @CompactAI is your only hint!

3 replies

updated a model 13 days ago

jedisct1/MiMo-V2.5-MLX-first224-moe-gup3-down2-attn4-qhead

Text Generation • 271B • Updated 13 days ago • 1.45k

published a model 14 days ago

jedisct1/MiMo-V2.5-MLX-first224-moe-gup3-down2-attn4-qhead

Text Generation • 271B • Updated 13 days ago • 1.45k

updated a model 14 days ago

jedisct1/MiMo-V2.5-MLX-4bit-first130experts-qhead

Text Generation • 160B • Updated 14 days ago • 1.16k

published a model 14 days ago

jedisct1/MiMo-V2.5-MLX-4bit-first130experts-qhead

Text Generation • 160B • Updated 14 days ago • 1.16k

liked a model 15 days ago

XiaomiMiMo/MiMo-V2.5

311B • Updated 4 days ago • 78.2k • 235

liked a model 16 days ago

mlx-community/DeepSeek-V4-Flash-2bit-DQ

Text Generation • 284B • Updated 16 days ago • 29.3k • 41

reacted to anakin87's post with 🔥❤️ 17 days ago

Post

3289

A small model that struggled against a random opponent now beats GPT-5-mini at tic-tac-toe

I took LiquidAI/LFM2-2.6B and trained it through play.

🧑‍🍳 Here's how:

1️⃣ Build a solid RL env with Verifiers (Prime Intellect)
2️⃣ Generate synthetic data: <200 games sampled from GPT-5-mini playing in the env
3️⃣ SFT warm-up to teach format
4️⃣ Group-based RL (CISPO) against opponents making 20-70% random moves
5️⃣ RL again with stronger opponents (0-25% random moves) + 1.25 temperature to push exploration and shake off suboptimal strategies

Done! Beats GPT-5-mini 🏆

---

🎮 Play against the model: anakin87/LFM2-2.6B-mr-tictactoe

🤗 Model: anakin87/LFM2-2.6B-mr-tictactoe

📚 Walkthrough/course: https://github.com/anakin87/llm-rl-environments-lil-course

🤗 Dataset and checkpoints: https://huggingface.co/collections/anakin87/lfm2-26b-mr-tic-tac-toe