Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
ace-1
/
mgpt2-tokenizer
like
1
Model card
Files
Files and versions
xet
Community
main
mgpt2-tokenizer
Ctrl+K
Ctrl+K
1 contributor
History:
8 commits
ace-1
Update tokenizer artifact (verified corpus retrain) + eval metrics
68abf7e
verified
6 days ago
tokenizer
Fix transformers v5 auto_map + HF init
about 1 month ago
.gitattributes
Safe
1.52 kB
initial commit
about 2 months ago
README.md
Safe
1.8 kB
Publish mgpt2 tokenizer (GPT-2 exact merges) + eval metrics
about 1 month ago
added_tokens.json
29 Bytes
Upload mgpt2 tokenizer
about 1 month ago
evaluation.json
Safe
7.57 kB
Update tokenizer artifact (verified corpus retrain) + eval metrics
6 days ago
special_tokens_map.json
Safe
35 Bytes
Upload mgpt2 tokenizer
about 2 months ago
tokenization_mgpt2.py
Safe
80 Bytes
Publish mgpt2 tokenizer (GPT-2 exact merges) + eval metrics
about 1 month ago
tokenizer.model
464 kB
xet
Update tokenizer artifact (verified corpus retrain) + eval metrics
6 days ago
tokenizer.vocab
Safe
1.76 MB
Update tokenizer artifact (verified corpus retrain) + eval metrics
6 days ago
tokenizer_config.json
Safe
530 Bytes
Fix transformers v5 auto_map + HF init
about 1 month ago