Text Generation
Transformers
Safetensors
MLX
starcoder2
code
Eval Results (legacy)
text-generation-inference
# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM
tokenizer = AutoTokenizer.from_pretrained("mlx-community/starcoder2-7b-4bit")
model = AutoModelForCausalLM.from_pretrained("mlx-community/starcoder2-7b-4bit")Quick Links
mlx-community/starcoder2-7b-4bit
This model was converted to MLX format from bigcode/starcoder2-7b.
Refer to the original model card for more details on the model.
Use with mlx
pip install mlx-lm
from mlx_lm import load, generate
model, tokenizer = load("mlx-community/starcoder2-7b-4bit")
response = generate(model, tokenizer, prompt="hello", verbose=True)
- Downloads last month
- 271
Hardware compatibility
Log In to add your hardware
Quantized
Dataset used to train mlx-community/starcoder2-7b-4bit
Evaluation results
- pass@1 on CruxEval-Iself-reported34.600
- pass@1 on DS-1000self-reported27.800
- accuracy on GSM8K (PAL)self-reported40.400
- pass@1 on HumanEval+self-reported29.900
- pass@1 on HumanEvalself-reported35.400
- edit-smiliarity on RepoBench-v1.1self-reported72.070
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="mlx-community/starcoder2-7b-4bit")