Gerald Stanje
Gerald001
·
AI & ML interests
None yet
Organizations
Any plans for gpt oss 20b?
2
#1 opened 8 months ago
by
andhakanoon
Upload TRT model for Nvidia H100
#5 opened 3 months ago
by
robustdev
Upload TRT model for Nvidia H100
#4 opened 3 months ago
by
robustdev
Upload TRT model for Nvidia H200
#3 opened 3 months ago
by
robustdev
Upload TRT model for Nvidia L40S
#2 opened 3 months ago
by
robustdev
Upload ONNX model for Nvidia L40S
#1 opened 3 months ago
by
robustdev
Eagle3 for 20b
🚀 1
3
#4 opened 8 months ago
by
cr-boostrun
how to disable the reasoning mode?
👍 13
11
#50 opened 11 months ago
by
szzzzz
GGUF is very slow for some reason
2
#12 opened 10 months ago
by
ineersa
Model conversion info
11
#9 opened 4 months ago
by
Gerald001
NVIDIA L40S GPU's for MXFP4 quantization
6
#100 opened 11 months ago
by
lordim
How to turn off thinking mode
👍🔥 7
15
#86 opened 11 months ago
by
Gierry
REASONING SETTING GUIDE 📚
😔👍 4
28
#28 opened 11 months ago
by
xbruce22
Update chat_template.jinja
3
#229 opened 5 months ago
by
mohsin17444
question: setting reasoning effort
6
#66 opened 11 months ago
by
TheBigBlockPC
Unable to load gpt-oss-20b on dual L40 (48GB) GPUs with vLLM
12
#136 opened 10 months ago
by
yjban
assistantfinal, analysis keyword is contained in the huggingface gpt-oss-120 output. Is this intended?
👍 4
4
#130 opened 10 months ago
by
ml345
deploying finetuned model on triton using vllm backend
1
#157 opened 8 months ago
by
Prabhjot410
Guidance Needed: GPT-OSS 20B Fine-Tuning with Unsloth → GGUF → Ollama → Triton (vLLM / TensorRT-LLM)
3
#225 opened 5 months ago
by
GauravEA