LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model Paper • 2604.20796 • Published 14 days ago • 239
EXAONE 4.5 Collection LG's First Open-Weight Vision-Language Model for Industrial Intelligence • 5 items • Updated 13 days ago • 42
💧 LFM2.5 Collection Collection of post-trained and base LFM2.5 models. • 30 items • Updated 27 days ago • 135
AR-Lightx2v Collection Efficient autoregressive video generation (i.e., the Self-Forcing family) checkpoints. • 2 items • Updated Mar 2 • 3
Granite Vision Collection Multimodal models built for visual document analysis and image understanding. • 7 items • Updated 6 days ago • 40
Unsloth Diffusion GGUFs Collection Find GGUFs and other variants of diffusion based models like Qwen-Image and FLUX. • 20 items • Updated 13 days ago • 82
pplx-embed Collection Diffusion-Pretrained Dense and Contextual Embeddings • 7 items • Updated Feb 26 • 96
Granite 4.0 Language Models Collection Efficient language models for multilingual generation, coding, RAG, and AI assistant workflows. • 11 items • Updated 6 days ago • 220
Transformers.js V4 demos Collection A collection of demos built with Transformers.js V4 • 24 items • Updated 19 days ago • 58
view article Article Make your ZeroGPU Spaces go brrr with ahead-of-time compilation +2 Sep 2, 2025 • 77
Cosmos-Reason2 Collection Cosmos Reason 2 is an open, customizable, reasoning vision language model (VLM) for physical AI and robotics • 8 items • Updated 5 days ago • 24
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Paper • 2503.11576 • Published Mar 14, 2025 • 157
Agent READMEs: An Empirical Study of Context Files for Agentic Coding Paper • 2511.12884 • Published Nov 17, 2025 • 28