arxiv:2604.08865
TIANYI
BIMU233
AI & ML interests
None yet
Recent Activity
upvoted a paper 10 days ago
Bridging the Agent-World Gap: Text World Models for LLM-based Agents upvoted a paper 16 days ago
MIRA: Mid-training Rubric Anchoring for Source-Aware Data Selection upvoted a paper about 1 month ago
Conditional Equivalence of DPO and RLHF: Implicit Assumption, Failure Modes, and Provable AlignmentOrganizations
None yet