AI & ML interests
Energy-aware Computing, Low Power Design, EDA, Dark Silicon, Efficient Deep Learning
Recent Activity
View all activity
Papers
UniQL: Unified Quantization and Low-rank Compression for Adaptive Edge LLMs
Quamba2: A Robust and Scalable Post-training Quantization Framework for Selective State Space Models
models
37
ut-enyac/quamba2-8b-converted-w4aX
Text Generation
•
Updated
•
17
ut-enyac/quamba-chat-w4a8
Text Generation
•
Updated
•
19
ut-enyac/quamba2-2.7b-w4a8
Text Generation
•
Updated
•
21
ut-enyac/quamba2-8b-converted-w4a8
Text Generation
•
Updated
•
19
•
1
ut-enyac/quamba-chat-w8a8
Updated
•
14
ut-enyac/quamba-chat-w4a16
Updated
•
13
ut-enyac/quamba2-8b-converted-w4a16
Updated
•
7
ut-enyac/quamba-790m-w8a8
Updated
•
8
ut-enyac/quamba-790m-w4a8
Updated
•
6
ut-enyac/quamba-790m-w4a16
Updated
•
8
datasets
0
None public yet