VILLAIN at AVerImaTeC: Verifying Image-Text Claims via Multi-Agent Collaboration Paper โข 2602.04587 โข Published Feb 4
CostNav: A Navigation Benchmark for Real-World Economic-Cost Evaluation of Physical AI Agents Paper โข 2511.20216 โข Published Nov 25, 2025
Team HUMANE at AVeriTeC 2025: HerO 2 for Efficient Fact Verification Paper โข 2507.11004 โข Published Jul 15, 2025 โข 1
Visual Funnel: Resolving Contextual Blindness in Multimodal Large Language Models Paper โข 2512.10362 โข Published Dec 11, 2025 โข 1
D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI Paper โข 2510.05684 โข Published Oct 7, 2025 โข 143
Exploring Fine-Tuning of Large Audio Language Models for Spoken Language Understanding under Limited Speech data Paper โข 2509.15389 โข Published Sep 18, 2025 โข 3
Sleeping i18n Agent - Contribute in Just 5 Minutes ๐ค Translate Hugging Face docs into multiple languages
Sleeping 1 i18n Agent - Contribute in Just 5 Minutes ๐ค 1 Translate Hugging Face docs into multiple languages