Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
togethercomputer
/
m2-bert-80M-8k-retrieval
like
33
Follow
Together
744
Sentence Similarity
Transformers
PyTorch
English
m2_bert
text-classification
custom_code
arxiv:
2310.12109
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
7
Deploy
Use this model
refs/pr/3
m2-bert-80M-8k-retrieval
352 MB
2 contributors
History:
18 commits
Tom Aarsen
Remove AutoTokenizer from the config
7d7932e
almost 2 years ago
.gitattributes
1.52 kB
initial commit
about 2 years ago
README.md
3.47 kB
Update README.md
almost 2 years ago
bert_layers.py
40.7 kB
Automodel support
almost 2 years ago
bert_padding.py
5.97 kB
Automodel support
almost 2 years ago
blockdiag_linear.py
3.16 kB
Automodel support
almost 2 years ago
blockdiag_multiply.py
2.87 kB
Automodel support
almost 2 years ago
config.json
1.42 kB
Remove AutoTokenizer from the config
almost 2 years ago
config.yaml
922 Bytes
8k retrieval model
about 2 years ago
configuration_bert.py
2.41 kB
Automodel support
almost 2 years ago
generation_config.json
130 Bytes
Automodel support
almost 2 years ago
hyena_utils.py
8.35 kB
Automodel support
almost 2 years ago
monarch_mixer_sequence_mixer.py
5.14 kB
Automodel support
almost 2 years ago
pytorch_model.bin
351 MB
xet
Model
almost 2 years ago
special_tokens_map.json
125 Bytes
Add bert-base-cased tokenizer with model_max_length: 8192
almost 2 years ago
structured_linear.py
2.58 kB
Automodel support
almost 2 years ago
tokenizer.json
669 kB
Add bert-base-cased tokenizer with model_max_length: 8192
almost 2 years ago
tokenizer_config.json
316 Bytes
Add bert-base-cased tokenizer with model_max_length: 8192
almost 2 years ago
version.txt
2 Bytes
8k retrieval model
about 2 years ago
vocab.txt
213 kB
Add bert-base-cased tokenizer with model_max_length: 8192
almost 2 years ago