Center for Language and Speech Processing @ JHU

university

https://www.clsp.jhu.edu/

jhuclsp

JHU-CLSP

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

TaiMingLu authored a paper 18 days ago

Stronger Normalization-Free Transformers

orionweller new activity 19 days ago

jhu-clsp/mmBERT-decay-data:Update README: Fix TiQuAD's language name to Tigrinya

TaiMingLu authored a paper about 1 month ago

World-in-World: World Models in a Closed-Loop World

View all activity

Papers

Genomic Next-Token Predictors are In-Context Learners

Controlled Generation for Private Synthetic Text

View all Papers

Collections 3

View 3 collections

spaces 1

Science Hierarchography

Explore academic paper hierarchies and details

models 53

jhu-clsp/mmBERT-small

Fill-Mask • Updated Oct 17 • 11.7k • • 57

jhu-clsp/mmBERT-base

Fill-Mask • Updated Oct 7 • 303k • • 172

jhu-clsp/mmBERT-checkpoints

Updated Sep 9 • 3

jhu-clsp/ettin-decoder-1b

Fill-Mask • Updated Jul 21 • 263 • 4

jhu-clsp/ettin-decoder-32m

Text Generation • Updated Jul 18 • 194

jhu-clsp/ettin-encoder-1b

Feature Extraction • Updated Jul 18 • 405 • 21

jhu-clsp/ettin-encoder-68m

Fill-Mask • Updated Jul 18 • 134 • • 3

jhu-clsp/ettin-dec-from-enc-32m

Text Generation • Updated Jul 18 • 12

jhu-clsp/ettin-encoder-150m

Fill-Mask • Updated Jul 18 • 11.5k • • 8

jhu-clsp/ettin-decoder-400m

Text Generation • Updated Jul 18 • 199 • 2

datasets 38

jhu-clsp/mmBERT-decay-data

Updated 19 days ago • 9.72k • 3

jhu-clsp/mmBERT-midtraining-data

Updated Oct 13 • 31.1k • 1

jhu-clsp/megawika-2

Updated Sep 3 • 9.19k • 2

jhu-clsp/ettin-pretraining-data

Updated Jul 18 • 21.8k • 8

jhu-clsp/ettin-decay-data

Updated Jul 18 • 3.56k • 1

jhu-clsp/astro-llms-benchmark-dataset

Viewer • Updated Jul 16 • 40 • 32

jhu-clsp/astro-llms-full-query-data

Viewer • Updated Jul 16 • 368 • 25

jhu-clsp/ettin-extension-data

Updated Jul 16 • 2.26k

jhu-clsp/ettin-data-order

Viewer • Updated Jul 16 • 3B • 7 • 1

jhu-clsp/rank1-R1-MSMARCO

Viewer • Updated Feb 26 • 635k • 39 • 2

View 38 datasets