Chasing the Public Score: User Pressure and Evaluation Exploitation in Coding Agent Workflows Paper • 2604.20200 • Published 7 days ago • 5
VLAA-GUI: Knowing When to Stop, Recover, and Search, A Modular Framework for GUI Automation Paper • 2604.21375 • Published 6 days ago • 17
LateOn-Code 💻 Collection State-of-the-art late interaction code retrieval models • 6 items • Updated 22 days ago • 18