Yue Zhongqi's picture

3

Yue Zhongqi

nickyue

https://yue-zhongqi.github.io/

yue-zhongqi

AI & ML interests

post-training, multimodal large language models, generalization

Organizations

None yet

upvoted a paper 2 months ago

Expanding the Action Space of LLMs to Reason Beyond Language

Paper • 2510.07581 • Published Oct 8 • 7

upvoted an article 7 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7

•

263

upvoted a paper over 1 year ago

Exploring Diffusion Time-steps for Unsupervised Representation Learning

Paper • 2401.11430 • Published Jan 21, 2024 • 1