The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding Paper • 2512.19693 • Published 18 days ago • 62
Unified Multimodal Model Collection A curated list for Multimodal Model Generation papers. • 18 items • Updated Nov 27, 2025 • 4
From Pixels to Words -- Towards Native Vision-Language Primitives at Scale Paper • 2510.14979 • Published Oct 16, 2025 • 66
Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation Paper • 2510.08673 • Published Oct 9, 2025 • 125
Denoising as Adaptation: Noise-Space Domain Adaptation for Image Restoration Paper • 2406.18516 • Published Jun 26, 2024 • 4
Reconstructing 4D Spatial Intelligence: A Survey Paper • 2507.21045 • Published Jul 28, 2025 • 37