Base Model for TransMLA
mengfanxu
fxmeng
AI & ML interests
None yet
Recent Activity
upvoted a paper about 17 hours ago
Generative Refinement Networks for Visual Synthesis upvoted a paper 15 days ago
HISA: Efficient Hierarchical Indexing for Fine-Grained Sparse Attention authored a paper 17 days ago
LIFT: Improving Long Context Understanding of Large Language Models
through Long Input Fine-TuningOrganizations
None yet