Yuseung "Phillip" Lee

phillipinseoul

https://phillipinseoul.github.io/

phillipinseoul

AI & ML interests

Computer Vision

Recent Activity

upvoted a paper about 1 hour ago

Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation

upvoted a paper 1 day ago

SkillClaw: Let Skills Evolve Collectively with Agentic Evolver

upvoted a paper 1 day ago

VisionFoundry: Teaching VLMs Visual Perception with Synthetic Images

View all activity

Organizations

upvoted a paper about 1 hour ago

Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation

Paper • 2604.10098 • Published 3 days ago • 44

upvoted 3 papers 1 day ago

upvoted 3 papers 4 days ago

OpenSpatial: A Principled Data Engine for Empowering Spatial Intelligence

Paper • 2604.07296 • Published 6 days ago • 34

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published 6 days ago • 306

HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents

Paper • 2604.07430 • Published 6 days ago • 160

upvoted a paper 5 days ago

RAGEN-2: Reasoning Collapse in Agentic RL

Paper • 2604.06268 • Published 7 days ago • 60

upvoted 3 papers 6 days ago

Action Images: End-to-End Policy Learning via Multiview Video Generation

Paper • 2604.06168 • Published 7 days ago • 13

How Well Do Agentic Skills Work in the Wild: Benchmarking LLM Skill Usage in Realistic Settings

Paper • 2604.04323 • Published 8 days ago • 38

Vanast: Virtual Try-On with Human Image Animation via Synthetic Triplet Supervision

Paper • 2604.04934 • Published 8 days ago • 42

liked a Space 7 days ago

StyleRenderer

🎨

Generate stylized video from game G‑buffer inputs

upvoted a paper 8 days ago

Token Warping Helps MLLMs Look from Nearby Viewpoints

Paper • 2604.02870 • Published 11 days ago • 33

submitted a paper to Daily Papers 8 days ago

Token Warping Helps MLLMs Look from Nearby Viewpoints

Paper • 2604.02870 • Published 11 days ago • 33

upvoted 2 papers 8 days ago

A Simple Baseline for Streaming Video Understanding

Paper • 2604.02317 • Published 12 days ago • 72

The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook

Paper • 2604.02029 • Published 12 days ago • 138

liked a dataset 10 days ago

ellisbrown/SIMS-VSI

Viewer • Updated Nov 7, 2025 • 242k • 175 • 7

liked 2 datasets 11 days ago

rbler/MMSI-Video-Bench

Updated Feb 10 • 107 • 5

bigai/SceneVersepp

Updated 11 days ago • 600 • 3

upvoted a paper 12 days ago

Think, Act, Build: An Agentic Framework with Vision Language Models for Zero-Shot 3D Visual Grounding

Paper • 2604.00528 • Published 13 days ago • 12

Yuseung "Phillip" Lee

AI & ML interests

Recent Activity

Organizations

phillipinseoul's activity

StyleRenderer