weixun's picture

11

weixun

weixun

AI & ML interests

None yet

Recent Activity

upvoted a paper about 3 hours ago

Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem

upvoted a paper 9 days ago

Reasoning Palette: Modulating Reasoning via Latent Contextualization for Controllable Exploration for (V)LMs

upvoted a paper 3 months ago

Attention Illuminates LLM Reasoning: The Preplan-and-Anchor Rhythm Enables Fine-Grained Policy Optimization

View all activity

Organizations

None yet

upvoted a paper about 3 hours ago

Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem

Paper • 2512.24873 • Published 1 day ago • 28

upvoted a paper 9 days ago

Reasoning Palette: Modulating Reasoning via Latent Contextualization for Controllable Exploration for (V)LMs

Paper • 2512.17206 • Published 13 days ago • 19

upvoted 3 papers 3 months ago

Attention Illuminates LLM Reasoning: The Preplan-and-Anchor Rhythm Enables Fine-Grained Policy Optimization

Paper • 2510.13554 • Published Oct 15, 2025 • 57

Part II: ROLL Flash -- Accelerating RLVR and Agentic Training with Asynchrony

Paper • 2510.11345 • Published Oct 13, 2025 • 15

GEM: A Gym for Agentic LLMs

Paper • 2510.01051 • Published Oct 1, 2025 • 89

upvoted a paper 4 months ago

Understanding Tool-Integrated Reasoning

Paper • 2508.19201 • Published Aug 26, 2025 • 32

upvoted a paper 7 months ago

Reinforcement Learning Optimization for Large-Scale Learning: An Efficient and User-Friendly Scaling Library

Paper • 2506.06122 • Published Jun 6, 2025 • 7

upvoted a paper 10 months ago

Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?

Paper • 2502.19361 • Published Feb 26, 2025 • 28

upvoted a paper 12 months ago

ProgCo: Program Helps Self-Correction of Large Language Models

Paper • 2501.01264 • Published Jan 2, 2025 • 26

upvoted a paper about 1 year ago

Chinese SimpleQA: A Chinese Factuality Evaluation for Large Language Models

Paper • 2411.07140 • Published Nov 11, 2024 • 35

upvoted a paper over 1 year ago

OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework

Paper • 2405.11143 • Published May 20, 2024 • 40