llm-jp/optimal-sparsity-math-d1024-E32-k2-3.5B-A470M
Text Generation
•
3B
•
Updated
•
4
llm-jp/optimal-sparsity-math-d1024-E16-k2-1.9B-A470M
Text Generation
•
2B
•
Updated
•
2
llm-jp/optimal-sparsity-math-d1024-E8-k2-1.1B-A470M
Text Generation
•
1B
•
Updated
•
2
llm-jp/optimal-sparsity-math-d512-E256-k2-6.6B-A170M
Text Generation
•
7B
•
Updated
•
2
llm-jp/optimal-sparsity-math-d512-E128-k2-3.3B-A170M
Text Generation
•
3B
•
Updated
•
2
llm-jp/optimal-sparsity-math-d512-E64-k2-1.7B-A170M
Text Generation
•
2B
•
Updated
•
2
llm-jp/optimal-sparsity-math-d512-E32-k2-920M-A170M
Text Generation
•
0.9B
•
Updated
•
4
llm-jp/optimal-sparsity-math-d512-E16-k2-520M-A170M
Text Generation
•
0.5B
•
Updated
•
3
llm-jp/optimal-sparsity-math-d512-E8-k2-320M-A170M
Text Generation
•
0.3B
•
Updated
•
1
llm-jp/optimal-sparsity-code-d2048-E128-k16-52.2B-A7.1B
Text Generation
•
52B
•
Updated
•
2
llm-jp/optimal-sparsity-code-d2048-E64-k16-26.4B-A7.1B
Text Generation
•
26B
•
Updated
•
1
llm-jp/optimal-sparsity-code-d2048-E32-k16-13.6B-A7.1B
Text Generation
•
14B
•
Updated
•
5
llm-jp/optimal-sparsity-code-d2048-E16-k16-7.1B-A7.1B
Text Generation
•
7B
•
Updated
•
3
llm-jp/optimal-sparsity-code-d1024-E256-k16-26.0B-A1.9B
Text Generation
•
26B
•
Updated
•
5
llm-jp/optimal-sparsity-code-d1024-E128-k16-13.2B-A1.9B
Text Generation
•
13B
•
Updated
•
2
llm-jp/optimal-sparsity-code-d1024-E64-k16-6.7B-A1.9B
Text Generation
•
7B
•
Updated
•
4
llm-jp/optimal-sparsity-code-d1024-E32-k16-3.5B-A1.9B
Text Generation
•
3B
•
Updated
•
2
llm-jp/optimal-sparsity-code-d1024-E16-k16-1.9B-A1.9B
Text Generation
•
2B
•
Updated
•
3
llm-jp/optimal-sparsity-code-d512-E256-k16-6.6B-A520M
Text Generation
•
7B
•
Updated
•
3
llm-jp/optimal-sparsity-code-d512-E128-k16-3.3B-A520M
Text Generation
•
3B
•
Updated
•
2
llm-jp/optimal-sparsity-code-d512-E64-k16-1.7B-A520M
Text Generation
•
2B
•
Updated
•
2
llm-jp/optimal-sparsity-code-d512-E32-k16-920M-A520M
Text Generation
•
0.9B
•
Updated
•
3
llm-jp/optimal-sparsity-code-d512-E16-k16-520M-A520M
Text Generation
•
0.5B
•
Updated
•
3
llm-jp/optimal-sparsity-code-d2048-E128-k8-52.2B-A3.9B
Text Generation
•
52B
•
Updated
•
3
llm-jp/optimal-sparsity-code-d2048-E64-k8-26.4B-A3.9B
Text Generation
•
26B
•
Updated
•
4
llm-jp/optimal-sparsity-code-d2048-E32-k8-13.6B-A3.9B
Text Generation
•
14B
•
Updated
•
4
llm-jp/optimal-sparsity-code-d2048-E16-k8-7.1B-A3.9B
Text Generation
•
7B
•
Updated
•
3
llm-jp/optimal-sparsity-code-d2048-E8-k8-3.9B-A3.9B
Text Generation
•
4B
•
Updated
•
3
llm-jp/optimal-sparsity-code-d1024-E256-k8-26.0B-A1.1B
Text Generation
•
26B
•
Updated
•
1
llm-jp/optimal-sparsity-code-d1024-E128-k8-13.2B-A1.1B
Text Generation
•
13B
•
Updated
•
4