AllenRL2's picture

3

AllenRL2

AllenRL2

AI & ML interests

LLM

Organizations

None yet

upvoted a paper 3 months ago

ASPO: Asymmetric Importance Sampling Policy Optimization

Paper • 2510.06062 • Published Oct 7 • 13

upvoted a paper 4 months ago

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10 • 190

upvoted a paper 11 months ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published Feb 10 • 153