3 21 8

Hasan Arif

hasanar1f

AI & ML interests

Efficient training and inference

Recent Activity

liked a dataset about 2 months ago

OpenAssistant/oasst1

liked a dataset about 2 months ago

allenai/WildChat-1M

upvoted a paper 3 months ago

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

View all activity

Organizations

upvoted 2 papers 3 months ago

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published Oct 13, 2025 • 178

StreamingVLM: Real-Time Understanding for Infinite Video Streams

Paper • 2510.09608 • Published Oct 10, 2025 • 50

upvoted a paper 7 months ago

Round Attention: A Novel Round-Level Attention Mechanism to Accelerate LLM Inference

Paper • 2502.15294 • Published Feb 21, 2025 • 1

upvoted a paper 8 months ago

From Token to Action: State Machine Reasoning to Mitigate Overthinking in Information Retrieval

Paper • 2505.23059 • Published May 29, 2025 • 13

upvoted 5 papers 10 months ago

upvoted a collection 10 months ago

ML Optimization Papers

Collection

19 items • Updated Apr 4, 2025 • 1

upvoted 6 papers 12 months ago

iFormer: Integrating ConvNet and Transformer for Mobile Application

Paper • 2501.15369 • Published Jan 26, 2025 • 13

Temporal Preference Optimization for Long-Form Video Understanding

Paper • 2501.13919 • Published Jan 23, 2025 • 23

Humanity's Last Exam

Paper • 2501.14249 • Published Jan 24, 2025 • 77

Fixing Imbalanced Attention to Mitigate In-Context Hallucination of Large Vision-Language Model

Paper • 2501.12206 • Published Jan 21, 2025 • 4

ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing

Paper • 2412.14711 • Published Dec 19, 2024 • 16

MapQaTor: A System for Efficient Annotation of Map Query Datasets

Paper • 2412.21015 • Published Dec 30, 2024 • 9

upvoted 2 papers about 1 year ago

TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters

Paper • 2410.23168 • Published Oct 30, 2024 • 24

NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks

Paper • 2410.20650 • Published Oct 28, 2024 • 17

upvoted a paper over 1 year ago

HiRED: Attention-Guided Token Dropping for Efficient Inference of High-Resolution Vision-Language Models in Resource-Constrained Environments

Paper • 2408.10945 • Published Aug 20, 2024 • 10

upvoted a collection almost 2 years ago

LLaVa-NeXT

Collection

LLaVa-NeXT (also known as LLaVa-1.6) improves upon the 1.5 series by incorporating higher image resolutions and more reasoning/OCR datasets. • 8 items • Updated Jul 19, 2024 • 32

Hasan Arif

AI & ML interests

Recent Activity

Organizations

hasanar1f's activity