4 15

Jiahao Meng

marinero4972

AI & ML interests

None yet

Recent Activity

updated a dataset 6 days ago

marinero4972/sampled_videos

published a dataset 9 days ago

marinero4972/sampled_videos

upvoted a paper about 2 months ago

Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

View all activity

Organizations

updated a dataset 6 days ago

marinero4972/sampled_videos

Viewer • Updated 6 days ago • 100 • 40

published a dataset 9 days ago

marinero4972/sampled_videos

Viewer • Updated 6 days ago • 100 • 40

upvoted a paper about 2 months ago

Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

Paper • 2511.08892 • Published Nov 12, 2025 • 200

updated a dataset about 2 months ago

marinero4972/Open-o3-Video

Preview • Updated Nov 11, 2025 • 154 • 6

authored 2 papers 2 months ago

DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real World

Paper • 2506.24102 • Published Jun 30, 2025

Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence

Paper • 2510.20579 • Published Oct 23, 2025 • 55

commented a paper 2 months ago

Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence

Paper • 2510.20579 • Published Oct 23, 2025 • 55 •

upvoted a paper 2 months ago

Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence

Paper • 2510.20579 • Published Oct 23, 2025 • 55

published a dataset 2 months ago

marinero4972/Open-o3-Video

Preview • Updated Nov 11, 2025 • 154 • 6

published a model 2 months ago

marinero4972/Open-o3-Video

8B • Updated Oct 23, 2025 • 56 • 4

updated a model 2 months ago

marinero4972/Open-o3-Video

8B • Updated Oct 23, 2025 • 56 • 4

authored a paper 2 months ago

Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs

Paper • 2510.18876 • Published Oct 21, 2025 • 36

upvoted a paper 2 months ago

Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs

Paper • 2510.18876 • Published Oct 21, 2025 • 36

upvoted a collection 3 months ago

Qwen3-VL

Collection

37 items • Updated 4 days ago • 555

upvoted 2 papers 6 months ago

Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology

Paper • 2507.07999 • Published Jul 10, 2025 • 49

VMoBA: Mixture-of-Block Attention for Video Diffusion Models

Paper • 2506.23858 • Published Jun 30, 2025 • 31

authored a paper 7 months ago

CyberV: Cybernetics for Test-time Scaling in Video Understanding

Paper • 2506.07971 • Published Jun 9, 2025 • 5

upvoted a paper 7 months ago

CyberV: Cybernetics for Test-time Scaling in Video Understanding

Paper • 2506.07971 • Published Jun 9, 2025 • 5

commented a paper 7 months ago

CyberV: Cybernetics for Test-time Scaling in Video Understanding

Paper • 2506.07971 • Published Jun 9, 2025 • 5 •

published a dataset 7 months ago

marinero4972/CyberV_ASR

Viewer • Updated May 23, 2025 • 10.6k • 499

Jiahao Meng

AI & ML interests

Recent Activity

Organizations

marinero4972's activity