AdaCM^2: On Understanding Extremely Long-Term Video with Adaptive Cross-Modality Memory Reduction Paper โข 2411.12593 โข Published Nov 19, 2024 โข 1
OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM Paper โข 2510.15870 โข Published Oct 17 โข 89