SparseLLMs

community

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

Raincleared updated a model 2 days ago

SparseLLM/DECO-0.5B

Raincleared updated a model 2 days ago

SparseLLM/DECO-0.2B

Raincleared updated a model 2 days ago

SparseLLM/DECO-0.1B

View all activity

updated 3 models 2 days ago

SparseLLM/DECO-0.5B

Text Generation • Updated 2 days ago • 149

SparseLLM/DECO-0.2B

Text Generation • Updated 2 days ago • 145

SparseLLM/DECO-0.1B

Text Generation • Updated 2 days ago • 43

in SparseLLM/DECO-1.2B 2 days ago

Add library_name metadata and improve model card

#1 opened 2 days ago by

in SparseLLM/DECO-0.5B 2 days ago

Add library_name metadata

#1 opened 2 days ago by

in SparseLLM/DECO-0.2B 2 days ago

Add library_name to metadata

#1 opened 2 days ago by

in SparseLLM/DECO-0.1B 2 days ago

Add library_name metadata

#1 opened 2 days ago by

published a model 3 days ago

SparseLLM/DECO-0.1B

Text Generation • Updated 2 days ago • 43

updated a model 3 days ago

SparseLLM/DECO-1.2B

Text Generation • Updated 2 days ago • 44

published 3 models 3 days ago

SparseLLM/DECO-1.2B

Text Generation • Updated 2 days ago • 44

SparseLLM/DECO-0.5B

Text Generation • Updated 2 days ago • 149

SparseLLM/DECO-0.2B

Text Generation • Updated 2 days ago • 145

submitted a paper to Daily Papers 3 days ago

DECO: Sparse Mixture-of-Experts with Dense-Comparable Performance on End-Side Devices

Paper • 2605.10933 • Published 5 days ago • 2

authored a paper 4 months ago

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published Dec 31, 2025 • 326

authored 2 papers 10 months ago

PipeLLM: Fast and Confidential Large Language Model Services with Speculative Pipelined Encryption

Paper • 2411.03357 • Published Nov 4, 2024

SmallThinker: A Family of Efficient Large Language Models Natively Trained for Local Deployment

Paper • 2507.20984 • Published Jul 28, 2025 • 58

authored a paper 10 months ago

SmallThinker: A Family of Efficient Large Language Models Natively Trained for Local Deployment

Paper • 2507.20984 • Published Jul 28, 2025 • 58

authored 3 papers 10 months ago

ConPET: Continual Parameter-Efficient Tuning for Large Language Models

Paper • 2309.14763 • Published Sep 26, 2023 • 1

ReLU$^2$ Wins: Discovering Efficient Activation Functions for Sparse LLMs

Paper • 2402.03804 • Published Feb 6, 2024 • 4

ProSparse: Introducing and Enhancing Intrinsic Activation Sparsity within Large Language Models

Paper • 2402.13516 • Published Feb 21, 2024 • 1