Granite 4.1 Language Models Collection Efficient language models for multilingual generation, coding, RAG, and AI assistant workflows. • 6 items • Updated 3 days ago • 30
Laguna XS.2 Collection Designed for agentic coding and long-horizon work on a local machine. Apache 2.0. • 4 items • Updated 4 days ago • 14
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization Paper • 2603.19835 • Published Mar 20 • 349
CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning Paper • 2509.22647 • Published Sep 26, 2025 • 36
HauhauCS Safetensor Benchmarks Collection Benchmarks and safetensor formats from any analysis comparing abliterated models • 8 items • Updated 1 day ago • 4
Introspective Diffusion Language Models (I-DLM) Collection Model checkpoints for I-DLM. Paper: https://arxiv.org/abs/2604.11035 • 3 items • Updated 18 days ago • 10
InCoder-32B-Thinking: Industrial Code World Model for Thinking Paper • 2604.03144 • Published 30 days ago • 233
MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU Paper • 2604.05091 • Published 27 days ago • 45
EXAONE 4.5 Collection LG's First Open-Weight Vision-Language Model for Industrial Intelligence • 5 items • Updated 10 days ago • 42
DFlash Collection Block Diffusion for Flash Speculative Decoding • 15 items • Updated 7 days ago • 94
Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs? Paper • 2603.24472 • Published Mar 25 • 54