This model cannot be converted using llama.cpp quantization

by jab0688 - opened Jan 13

Jan 13

This model cannot be converted using llama.cpp quantization, as it reports that the corresponding tokenizer.model file cannot be found.

python.exe convert-hf-to-gguf.py ./mymodel --outfile ll33.gguf --outtype f16
Loading model: mymodel
gguf: This GGUF file is for Little Endian only
Set model parameters
gguf: context length = 8192
gguf: embedding length = 4096
gguf: feed forward length = 14336
gguf: head count = 32
gguf: key-value head count = 8
gguf: rope theta = 500000.0
gguf: rms norm epsilon = 1e-05
gguf: file type = 1
Set model tokenizer
Error: Missing mymodel\tokenizer.model

inflatebot

Allura (Forge) org Jan 14

https://huggingface.co/bartowski/allura-forge_Llama-3.3-8B-Instruct-GGUF

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment