22 27 28

zehuan-huang PRO

huanngzh

AI & ML interests

AIGC and 3D vision

Recent Activity

upvoted an article 15 days ago

NEO-unify: Building Native Multimodal Unified Models End to End

upvoted a paper 19 days ago

Stereo World Model: Camera-Guided Stereo Video Generation

submitted a paper 19 days ago

Stereo World Model: Camera-Guided Stereo Video Generation

View all activity

Organizations

upvoted an article 15 days ago

Article

NEO-unify: Building Native Multimodal Unified Models End to End

Mar 5

•

107

upvoted a paper 19 days ago

Stereo World Model: Camera-Guided Stereo Video Generation

Paper • 2603.17375 • Published 20 days ago • 11

upvoted a paper 20 days ago

SegviGen: Repurposing 3D Generative Model for Part Segmentation

Paper • 2603.16869 • Published 21 days ago • 18

upvoted a paper about 2 months ago

SAGE: Scalable Agentic 3D Scene Generation for Embodied AI

Paper • 2602.10116 • Published Feb 10 • 13

upvoted 3 papers 4 months ago

RoboTracer: Mastering Spatial Trace with Reasoning in Vision-Language Models for Robotics

Paper • 2512.13660 • Published Dec 15, 2025 • 37

Captain Safari: A World Engine

Paper • 2511.22815 • Published Nov 28, 2025 • 12

Geometrically-Constrained Agent for Spatial Reasoning

Paper • 2511.22659 • Published Nov 27, 2025 • 41

upvoted a paper 5 months ago

SAM 3D: 3Dfy Anything in Images

Paper • 2511.16624 • Published Nov 20, 2025 • 114

upvoted a paper 6 months ago

AToken: A Unified Tokenizer for Vision

Paper • 2509.14476 • Published Sep 17, 2025 • 37

upvoted a paper 7 months ago

VoxHammer: Training-Free Precise and Coherent 3D Editing in Native 3D Space

Paper • 2508.19247 • Published Aug 26, 2025 • 43

upvoted a paper 8 months ago

Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation

Paper • 2508.05635 • Published Aug 7, 2025 • 73

upvoted 3 papers 9 months ago

π^3: Scalable Permutation-Equivariant Visual Geometry Learning

Paper • 2507.13347 • Published Jul 17, 2025 • 67

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1, 2025 • 252

Use Property-Based Testing to Bridge LLM Code Generation and Validation

Paper • 2506.18315 • Published Jun 23, 2025 • 11

upvoted 3 papers 10 months ago

AnimaX: Animating the Inanimate in 3D with Joint Video-Pose Diffusion Models

Paper • 2506.19851 • Published Jun 24, 2025 • 60

RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics

Paper • 2506.04308 • Published Jun 4, 2025 • 43

UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation

Paper • 2505.24521 • Published May 30, 2025 • 15

upvoted a collection 11 months ago

Objaverse-Render

Collection

2 items • Updated May 20, 2025 • 1

upvoted a collection about 1 year ago

MV-Adapter Spaces

Collection

6 items • Updated Dec 3, 2025 • 10

upvoted a paper about 1 year ago

Personalize Anything for Free with Diffusion Transformer

Paper • 2503.12590 • Published Mar 16, 2025 • 44

zehuan-huang PRO

AI & ML interests

Recent Activity

Organizations

huanngzh's activity

NEO-unify: Building Native Multimodal Unified Models End to End