1 7 4

Yang Penghui

ygyjrc

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games

upvoted a paper 16 days ago

OVO-S-Bench: A Hierarchical Benchmark for Streaming Spatial Intelligence in Multimodal LLMs

upvoted a paper 19 days ago

COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation

View all activity

Organizations

None yet

upvoted a paper 2 days ago

Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games

Paper • 2606.19338 • Published 3 days ago • 43

upvoted a paper 16 days ago

OVO-S-Bench: A Hierarchical Benchmark for Streaming Spatial Intelligence in Multimodal LLMs

Paper • 2606.03890 • Published 18 days ago • 31

upvoted a paper 19 days ago

COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation

Paper • 2605.31264 • Published 22 days ago • 115

liked a dataset 22 days ago

internlm/CapRL-Video-QA-20K

Viewer • Updated 10 days ago • 20k • 262 • 6

upvoted a paper 24 days ago

SetCon: Towards Open-Ended Referring Segmentation via Set-Level Concept Prediction

Paper • 2605.20110 • Published May 19 • 4

liked a dataset 25 days ago

internlm/CapRL-Video-178K

Viewer • Updated 10 days ago • 170k • 248 • 8

upvoted a paper 26 days ago

ETCHR: Editing To Clarify and Harness Reasoning

Paper • 2605.23897 • Published 29 days ago • 13

liked a model 29 days ago

internlm/CapRL-Video-4B

5B • Updated 11 days ago • 250 • 10

New activity in NemoStation/Marlin-2B about 1 month ago

Question about the evaluation metrics for captioning benchmarks

#3 opened about 1 month ago by

ygyjrc

upvoted a paper about 1 month ago

WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation

Paper • 2605.10912 • Published May 11 • 46

upvoted a paper 3 months ago

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Paper • 2603.25040 • Published Mar 26 • 133

liked a dataset 3 months ago

internlm/WildClawBench

Benchmark • Updated May 15 • 11.5k • 62

Yang Penghui

AI & ML interests

Recent Activity

Organizations

ygyjrc's activity

Question about the evaluation metrics for captioning benchmarks