mmBERT is trained on 3T tokens from over 1800 languages, showing SoTA scores on benchmarks and exceptional low-resource performance
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
Genomic Next-Token Predictors are In-Context Learners
Controlled Generation for Private Synthetic Text
models
53
jhu-clsp/mmBERT-small
Fill-Mask
•
Updated
•
11.7k
•
•
57
jhu-clsp/mmBERT-base
Fill-Mask
•
Updated
•
303k
•
•
172
jhu-clsp/mmBERT-checkpoints
Updated
•
3
jhu-clsp/ettin-decoder-1b
Fill-Mask
•
Updated
•
263
•
4
jhu-clsp/ettin-decoder-32m
Text Generation
•
Updated
•
194
jhu-clsp/ettin-encoder-1b
Feature Extraction
•
Updated
•
405
•
21
jhu-clsp/ettin-encoder-68m
Fill-Mask
•
Updated
•
134
•
•
3
jhu-clsp/ettin-dec-from-enc-32m
Text Generation
•
Updated
•
12
jhu-clsp/ettin-encoder-150m
Fill-Mask
•
Updated
•
11.5k
•
•
8
jhu-clsp/ettin-decoder-400m
Text Generation
•
Updated
•
199
•
2
datasets
38
jhu-clsp/mmBERT-decay-data
Updated
•
9.72k
•
3
jhu-clsp/mmBERT-midtraining-data
Updated
•
31.1k
•
1
jhu-clsp/megawika-2
Updated
•
9.19k
•
2
jhu-clsp/ettin-pretraining-data
Updated
•
21.8k
•
8
jhu-clsp/ettin-decay-data
Updated
•
3.56k
•
1
jhu-clsp/astro-llms-benchmark-dataset
Viewer
•
Updated
•
40
•
32
jhu-clsp/astro-llms-full-query-data
Viewer
•
Updated
•
368
•
25
jhu-clsp/ettin-extension-data
Updated
•
2.26k
jhu-clsp/ettin-data-order
Viewer
•
Updated
•
3B
•
7
•
1
jhu-clsp/rank1-R1-MSMARCO
Viewer
•
Updated
•
635k
•
39
•
2