meituan-longcat/AMO-Bench
Viewer • Updated • 50 • 535 • 30
None defined yet.
LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning
$V_{0.5}$: Generalist Value Model as a Prior for Sparse RL Rollouts