view article Article Arcade-3B: SLM Optimization via Orthogonal Decoupling of Latent State Spaces 8 days ago • 1
view article Article Exploring New Frontiers of LLMs: Adaptive Dual-Search Distillation (ADS) and the 30B Model Open Beta 22 days ago • 2
view article Article Shattering the Memory Wall: O(1) Inference and Causal Monoid State Compression in Spartacus-1B 26 days ago • 2