webiraiz

webiraiz

·

AI & ML interests

None yet

Recent Activity

liked a Space about 16 hours ago

AimeeBingmouQu/ProtectBirds

upvoted a paper 8 days ago

Looped World Models

upvoted a paper 10 days ago

Duration Aware Scheduling for ASR Serving Under Workload Drift

View all activity

Organizations

None yet

upvoted a paper 8 days ago

Looped World Models

Paper • 2606.18208 • Published 14 days ago • 469

upvoted a paper 10 days ago

Duration Aware Scheduling for ASR Serving Under Workload Drift

Paper • 2603.11273 • Published Mar 11 • 3

upvoted a paper 17 days ago

ABot-Earth 0.5: Generative 3D Earth Model

Paper • 2606.09967 • Published 22 days ago • 485

upvoted a paper 26 days ago

Task-Focused Memorization for Multimodal Agents

Paper • 2605.31075 • Published May 29 • 40

upvoted a paper 28 days ago

Crafter: A Multi-Agent Harness for Editable Scientific Figure Generation from Diverse Inputs

Paper • 2605.30611 • Published May 28 • 250

upvoted 3 papers about 1 month ago

Conditional Equivalence of DPO and RLHF: Implicit Assumption, Failure Modes, and Provable Alignment

Paper • 2605.20834 • Published May 20 • 5

On the limits and opportunities of AI reviewers: Reviewing the reviews of Nature-family papers with 45 expert scientists

Paper • 2605.20668 • Published May 20 • 12

Adaptive Teacher Exposure for Self-Distillation in LLM Reasoning

Paper • 2605.11458 • Published May 12 • 7

upvoted 2 papers about 2 months ago

DecodingTrust-Agent Platform (DTap): A Controllable and Interactive Red-Teaming Platform for AI Agents

Paper • 2605.04808 • Published May 6 • 20

Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers

Paper • 2605.06169 • Published May 7 • 237

upvoted 10 papers 3 months ago

Graph of Skills: Dependency-Aware Structural Retrieval for Massive Agent Skills

Paper • 2604.05333 • Published Apr 7 • 23

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 509

When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models

Paper • 2604.08546 • Published Apr 9 • 116

Lingshu-Cell: A generative cellular world model for transcriptome modeling toward virtual cells

Paper • 2603.25240 • Published Mar 26 • 78

CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence

Paper • 2603.28032 • Published Mar 30 • 344

A Comparative Study in Surgical AI: Datasets, Foundation Models, and Barriers to Med-AGI

Paper • 2603.27341 • Published Mar 28 • 7

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published Mar 20 • 352

Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

Paper • 2603.25716 • Published Mar 26 • 157

ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling

Paper • 2603.25746 • Published Mar 26 • 155

MinerU-Diffusion: Rethinking Document OCR as Inverse Rendering via Diffusion Decoding

Paper • 2603.22458 • Published Mar 23 • 138