meituan-longcat/LongCat-Flash-Prover
Text Generation • 561B • Updated • 15 • 18
None defined yet.
$V_{0.5}$: Generalist Value Model as a Prior for Sparse RL Rollouts
ScaleEnv: Scaling Environment Synthesis from Scratch for Generalist Interactive Tool-Use Agent Training