4 1376

Shaobai Jiang

shaobaij

AI & ML interests

None yet

Recent Activity

upvoted a paper about 23 hours ago

FinMCP-Bench: Benchmarking LLM Agents for Real-World Financial Tool Use under the Model Context Protocol

upvoted a paper about 23 hours ago

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

upvoted a paper about 23 hours ago

GameplayQA: A Benchmarking Framework for Decision-Dense POV-Synced Multi-Video Understanding of 3D Virtual Agents

View all activity

Organizations

None yet

upvoted 8 papers about 23 hours ago

FinMCP-Bench: Benchmarking LLM Agents for Real-World Financial Tool Use under the Model Context Protocol

Paper • 2603.24943 • Published 5 days ago • 8

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

Paper • 2603.24472 • Published 5 days ago • 41

GameplayQA: A Benchmarking Framework for Decision-Dense POV-Synced Multi-Video Understanding of 3D Virtual Agents

Paper • 2603.24329 • Published 5 days ago • 19

EVA: Efficient Reinforcement Learning for End-to-End Video Agent

Paper • 2603.22918 • Published 6 days ago • 39

SlopCodeBench: Benchmarking How Coding Agents Degrade Over Long-Horizon Iterative Tasks

Paper • 2603.24755 • Published 5 days ago • 24

upvoted 12 papers 1 day ago

FASA: Frequency-aware Sparse Attention

Paper • 2602.03152 • Published Feb 3 • 153

OmniSIFT: Modality-Asymmetric Token Compression for Efficient Omni-modal Large Language Models

Paper • 2602.04804 • Published Feb 4 • 49

HySparse: A Hybrid Sparse Attention Architecture with Oracle Token Selection and KV Cache Sharing

Paper • 2602.03560 • Published Feb 3 • 49

OmniMoE: An Efficient MoE by Orchestrating Atomic Experts at Scale

Paper • 2602.05711 • Published Feb 5 • 12

Horizon-LM: A RAM-Centric Architecture for LLM Training

Paper • 2602.04816 • Published Feb 4 • 19

Training Data Efficiency in Multimodal Process Reward Models

Paper • 2602.04145 • Published Feb 4 • 79

Generative Visual Code Mobile World Models

Paper • 2602.01576 • Published Feb 2 • 42

POINTS-GUI-G: GUI-Grounding Journey

Paper • 2602.06391 • Published Feb 6 • 18

Research on World Models Is Not Merely Injecting World Knowledge into Specific Tasks

Paper • 2602.01630 • Published Feb 2 • 50

No Global Plan in Chain-of-Thought: Uncover the Latent Planning Horizon of LLMs

Paper • 2602.02103 • Published Feb 2 • 73

Self-Improving World Modelling with Latent Actions

Paper • 2602.06130 • Published Feb 5 • 32

Closing the Loop: Universal Repository Representation with RPG-Encoder

Paper • 2602.02084 • Published Feb 2 • 86

Shaobai Jiang

AI & ML interests

Recent Activity

Organizations

shaobaij's activity