Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
spuliz
/
Ahilab
like
0
Text Generation
Transformers
Safetensors
deepseek_v3
conversational
custom_code
text-generation-inference
fp8
arxiv:
2501.12948
License:
mit
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Ahilab
2.82 GB
1 contributor
History:
7 commits
spuliz
fp8
8555523
11 months ago
figures
Reinitialized repo with Deepseek R1 changes
11 months ago
.gitattributes
1.52 kB
Reinitialized repo with Deepseek R1 changes
11 months ago
LICENSE
1.06 kB
Reinitialized repo with Deepseek R1 changes
11 months ago
README.md
16 kB
Reinitialized repo with Deepseek R1 changes
11 months ago
config.json
1.73 kB
fp8
11 months ago
configuration_deepseek.py
10.6 kB
Reinitialized repo with Deepseek R1 changes
11 months ago
generation_config.json
171 Bytes
Reinitialized repo with Deepseek R1 changes
11 months ago
model-00039-of-000163.safetensors
2.8 GB
xet
Reinitialized repo with Deepseek R1 changes
11 months ago
model.safetensors.index.json
8.9 MB
Reinitialized repo with Deepseek R1 changes
11 months ago
modeling_deepseek.py
77.8 kB
fix
11 months ago
tokenizer.json
7.85 MB
Reinitialized repo with Deepseek R1 changes
11 months ago
tokenizer_config.json
3.59 kB
Reinitialized repo with Deepseek R1 changes
11 months ago