Zixi "Oz" Li PRO

OzTianlu

https://github.com/lizixi-0x2F

lizixi-0x2F

AI & ML interests

My research focuses on deep reasoning with small language models, Transformer architecture innovation, and knowledge distillation for efficient alignment and transfer.

Recent Activity

reacted to danielhanchen's post with 🔥 7 days ago

We collaborated with NVIDIA to teach you about Reinforcement Learning and RL environments. 💚 Learn: • Why RL environments matter + how to build them • When RL is better than SFT • GRPO and RL best practices • How verifiable rewards and RLVR work Blog: https://unsloth.ai/blog/rl-environments

repliedto their post 7 days ago

Arcade-3B — SmolReasoner https://huggingface.co/NoesisLab/Arcade-3B Arcade-3B is a 3B instruction-following and reasoning model built on SmolLM3-3B. It is the public release from the ARCADE project at NoesisLab, which investigates the State–Constraint Orthogonality Hypothesis: standard Transformer hidden states conflate factual content and reasoning structure in the same subspace, and explicitly decoupling them improves generalization.

updated a model 7 days ago

NoesisLab/Arcade-3B

View all activity

Organizations

authored a paper 2 months ago

Reasoning: From Reflection to Solution

Paper • 2511.11712 • Published Nov 12, 2025 • 2

Zixi "Oz" Li PRO

AI & ML interests

Recent Activity

Organizations

OzTianlu's activity