Expecting New NVFP4 for GLM-5.1 ASAP, Thanks!

#6
by ghostplant - opened

Expecting New NVFP4 for GLM-5.1 ASAP, Thanks!

Hi, We are working on it and plan to share asap.

Is it also supported by vLLM?

Hi @tstarkey-nvidia , which specific NVFP4 qformat is used for this nvidia/GLM-5-NVFP4?

We see a GLM-5.1 NVFP4 model shared from third-party: lukealonso/GLM-5.1-NVFP4, which seems to work pretty well:

https://github.com/microsoft/Tutel?tab=readme-ov-file#steps-for-glm-551-claude-code-mode

Sign up or log in to comment