This model cannot be converted using llama.cpp quantization

#8
by jab0688 - opened

This model cannot be converted using llama.cpp quantization, as it reports that the corresponding tokenizer.model file cannot be found.

python.exe convert-hf-to-gguf.py ./mymodel --outfile ll33.gguf --outtype f16
Loading model: mymodel
gguf: This GGUF file is for Little Endian only
Set model parameters
gguf: context length = 8192
gguf: embedding length = 4096
gguf: feed forward length = 14336
gguf: head count = 32
gguf: key-value head count = 8
gguf: rope theta = 500000.0
gguf: rms norm epsilon = 1e-05
gguf: file type = 1
Set model tokenizer
Error: Missing mymodel\tokenizer.model

Sign up or log in to comment