1 12 3

namgyu-youn

namgyu-youn

AI & ML interests

None yet

Recent Activity

upvoted a paper 20 days ago

GPTVQ: The Blessing of Dimensionality for LLM Quantization

updated a model 20 days ago

namgyu-youn/Qwen3-8B-INT4

updated a collection 20 days ago

TorchAO: Model Release

View all activity

Organizations

None yet

upvoted a paper 20 days ago

GPTVQ: The Blessing of Dimensionality for LLM Quantization

Paper • 2402.15319 • Published Feb 23, 2024 • 22

updated a model 20 days ago

namgyu-youn/Qwen3-8B-INT4

Updated 20 days ago • 114

updated a collection 20 days ago

TorchAO: Model Release

Collection

4 items • Updated 20 days ago

updated a model 20 days ago

namgyu-youn/Qwen3-8B-W8A8-INT

Text Generation • Updated 20 days ago • 70

published a model 20 days ago

namgyu-youn/Qwen3-8B-W8A8-INT

Text Generation • Updated 20 days ago • 70

updated 3 models 20 days ago

published 2 models 20 days ago

namgyu-youn/Qwen3-8B-W8A16-INT

Text Generation • Updated 20 days ago • 22

namgyu-youn/Qwen3-8B-W4A16-INT

Text Generation • Updated 20 days ago • 67

published a model 21 days ago

namgyu-youn/Qwen3-8B-W8A8-FP

Text Generation • Updated 20 days ago • 53

New activity in namgyu-youn/Qwen3-8B-INT4 22 days ago

Adding `safetensors` variant of this model

#1 opened 22 days ago by

SFconvertbot

published a model 23 days ago

namgyu-youn/Qwen3-8B-INT4

Updated 20 days ago • 114

updated a model about 1 month ago

namgyu-youn/Qwen3-0.6B-INT8-INT4-SINQ

Updated Dec 2, 2025 • 2

upvoted 2 papers about 1 month ago

FlashInfer: Efficient and Customizable Attention Engine for LLM Inference Serving

Paper • 2501.01005 • Published Jan 2, 2025 • 2

LFM2 Technical Report

Paper • 2511.23404 • Published Nov 28, 2025 • 42

updated 2 models about 1 month ago

namgyu-youn/Qwen3-30B-A3B-Thinking-2507-INT8-INT4-SINQ

Updated Nov 30, 2025 • 3

namgyu-youn/Qwen3-30B-A3B-Thinking-2507-INT8-INT4-HQQ

Updated Nov 27, 2025 • 2

namgyu-youn

AI & ML interests

Recent Activity

Organizations

namgyu-youn's activity

Adding `safetensors` variant of this model