4 27 4

Sangmin Bae

raymin0223

raymin0223

AI & ML interests

None yet

Recent Activity

upvoted a paper 27 days ago

Generative Visual Code Mobile World Models

liked a model about 2 months ago

upstage/Solar-Open-100B

upvoted a paper 3 months ago

TiDAR: Think in Diffusion, Talk in Autoregression

View all activity

Organizations

None yet

upvoted a paper 27 days ago

Generative Visual Code Mobile World Models

Paper • 2602.01576 • Published 29 days ago • 41

upvoted 2 papers 3 months ago

TiDAR: Think in Diffusion, Talk in Autoregression

Paper • 2511.08923 • Published Nov 12, 2025 • 128

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Paper • 2511.06221 • Published Nov 9, 2025 • 132

upvoted a paper 4 months ago

Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning

Paper • 2510.25992 • Published Oct 29, 2025 • 48

upvoted an article 4 months ago

Article

Why Did MiniMax M2 End Up as a Full Attention Model?

Oct 30, 2025

•

upvoted 2 papers 4 months ago

KLASS: KL-Guided Fast Inference in Masked Diffusion Models

Paper • 2511.05664 • Published Nov 7, 2025 • 37

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published Oct 29, 2025 • 229

upvoted 11 papers 5 months ago

Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models

Paper • 2510.04618 • Published Oct 6, 2025 • 129

MemMamba: Rethinking Memory Patterns in State Space Model

Paper • 2510.03279 • Published Sep 28, 2025 • 73

Apriel-1.5-15b-Thinker

Paper • 2510.01141 • Published Oct 1, 2025 • 121

SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs

Paper • 2510.05069 • Published Oct 6, 2025 • 13

LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding

Paper • 2404.16710 • Published Apr 25, 2024 • 80

Paper2Video: Automatic Video Generation from Scientific Papers

Paper • 2510.05096 • Published Oct 6, 2025 • 119

Hybrid Architectures for Language Models: Systematic Analysis and Design Insights

Paper • 2510.04800 • Published Oct 6, 2025 • 37

upvoted a paper 7 months ago

HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels

Paper • 2507.21809 • Published Jul 29, 2025 • 140

upvoted a paper 8 months ago

Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models

Paper • 2507.07104 • Published Jul 9, 2025 • 46

Sangmin Bae

AI & ML interests

Recent Activity

Organizations

raymin0223's activity

Why Did MiniMax M2 End Up as a Full Attention Model?