qinqi's picture

qinqi

Dakerqi

·

AI & ML interests

None yet

Recent Activity

new activity 1 day ago

inclusionAI/LLaDA2.0-Uni:ComfyUI support

authored a paper 6 days ago

UniPercept: Towards Unified Perceptual-Level Image Understanding across Aesthetics, Quality, Structure, and Texture

authored a paper 6 days ago

Accelerating Masked Image Generation by Learning Latent Controlled Dynamics

View all activity

Organizations

authored 8 papers 6 days ago

UniPercept: Towards Unified Perceptual-Level Image Understanding across Aesthetics, Quality, Structure, and Texture

Paper • 2512.21675 • Published Dec 25, 2025 • 26

Accelerating Masked Image Generation by Learning Latent Controlled Dynamics

Paper • 2602.23996 • Published Feb 27 • 8

InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing

Paper • 2603.09877 • Published Mar 10 • 48

Project Imaging-X: A Survey of 1000+ Open-Access Medical Imaging Datasets for Foundation Model Development

Paper • 2603.27460 • Published Mar 29 • 68

Training-Free Acceleration for Document Parsing Vision-Language Model with Hierarchical Speculative Decoding

Paper • 2602.12957 • Published Feb 13

Project Imaging-X: A Survey of 1000+ Open-Access Medical Imaging Datasets for Foundation Model Development

Paper • 2603.27460 • Published Mar 29 • 68

Project Imaging-X: A Survey of 1000+ Open-Access Medical Imaging Datasets for Foundation Model Development

Paper • 2603.27460 • Published Mar 29 • 68

LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model

Paper • 2604.20796 • Published 7 days ago • 235

authored 3 papers 4 months ago

Unimedvl: Unifying Medical Multimodal Understanding And Generation Through Observation-Knowledge-Analysis

Paper • 2510.15710 • Published Oct 17, 2025 • 8

Parameter-Efficient Fine-Tuning for Pre-Trained Vision Models: A Survey and Benchmark

Paper • 2402.02242 • Published Feb 3, 2024

dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models

Paper • 2512.19433 • Published Dec 22, 2025 • 3

authored 3 papers 7 months ago

Lumina-mGPT 2.0: Stand-Alone AutoRegressive Image Modeling

Paper • 2507.17801 • Published Jul 23, 2025 • 1

Resurrect Mask AutoRegressive Modeling for Efficient and Scalable Image Generation

Paper • 2507.13032 • Published Jul 17, 2025

Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding

Paper • 2510.06308 • Published Oct 7, 2025 • 55

authored 4 papers about 1 year ago

OmniCaptioner: One Captioner to Rule Them All

Paper • 2504.07089 • Published Apr 9, 2025 • 20

LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis

Paper • 2503.21749 • Published Mar 27, 2025 • 26

Lumina-Video: Efficient and Flexible Video Generation with Multi-scale Next-DiT

Paper • 2502.06782 • Published Feb 10, 2025 • 15

Lumina-Image 2.0: A Unified and Efficient Image Generative Framework

Paper • 2503.21758 • Published Mar 27, 2025 • 22