THE ORB - a galois77 Collection

galois77 's Collections

Thousand brains theory

THE ORB

energy based models

OCR

Poetry

Agentic

Videos

ahan

Image generation

Training optimization

RL

Benchmarks and challenges

THE ORB

updated 16 days ago

UniVA: Universal Video Agent towards Open-Source Next-Generation Video Generalist

Paper • 2511.08521 • Published Nov 11 • 37
Black-Box On-Policy Distillation of Large Language Models

Paper • 2511.10643 • Published Nov 13 • 47
Depth Anything 3: Recovering the Visual Space from Any Views

Paper • 2511.10647 • Published Nov 13 • 94
VGGT: Visual Geometry Grounded Transformer

Paper • 2503.11651 • Published Mar 14 • 34
Music Flamingo: Scaling Music Understanding in Audio Language Models

Paper • 2511.10289 • Published Nov 13 • 10
Canvas-to-Image: Compositional Image Generation with Multimodal Controls

Paper • 2511.21691 • Published 20 days ago • 33