16 28 57

random

fakerbaby

fakerbaby

AI & ML interests

NLP, RL, VLM

Recent Activity

liked a dataset 17 days ago

AlienKevin/SWE-ZERO-12M-trajectories

upvoted a paper 2 months ago

MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification

upvoted a collection 2 months ago

Qwen3.5

View all activity

Organizations

liked a dataset 17 days ago

AlienKevin/SWE-ZERO-12M-trajectories

Viewer • Updated 16 days ago • 12.3M • 15.8k • 114

upvoted a paper 2 months ago

MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification

Paper • 2603.15726 • Published Mar 16 • 187

upvoted a collection 2 months ago

Qwen3.5

Collection

21 items • Updated Mar 9 • 1.66k

liked a dataset 2 months ago

Agent-Ark/Toucan-1.5M

Viewer • Updated Oct 4, 2025 • 1.65M • 5.44k • 213

liked 2 datasets 3 months ago

yatin-superintelligence/Creative-Professionals-Agentic-Tasks-1M

Viewer • Updated Mar 13 • 1.07M • 532 • 25

ronantakizawa/github-codereview

Viewer • Updated Mar 10 • 356k • 531 • 54

upvoted a paper 3 months ago

SkyReels-V4: Multi-modal Video-Audio Generation, Inpainting and Editing model

Paper • 2602.21818 • Published Feb 25 • 55

upvoted an article 6 months ago

Article

We Got Claude to Fine-Tune an Open Source LLM

burtenshaw, evalstate

•

Dec 4, 2025

• 627

upvoted a paper 6 months ago

Skywork-R1V4: Toward Agentic Multimodal Intelligence through Interleaved Thinking with Images and DeepResearch

Paper • 2512.02395 • Published Dec 2, 2025 • 52

upvoted 2 papers 7 months ago

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published Oct 13, 2025 • 182

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

Paper • 2510.18927 • Published Oct 21, 2025 • 85

liked a model 8 months ago

zai-org/GLM-4.6

Text Generation • 357B • Updated Sep 30, 2025 • 40.4k • • 1.22k

liked a dataset 8 months ago

HuggingFaceM4/DoclingMatix

Viewer • Updated Jul 31, 2025 • 1.27M • 1.28k • 51

liked a Space 9 months ago

FineVision: Open Data is All You Need

📝

224

A new open-source dataset for training VLMs

liked 2 datasets 9 months ago

IPF/AIME25-CoT-CN

Viewer • Updated Feb 28 • 30 • 353 • 9

UniParser/RxnBench

Viewer • Updated 13 days ago • 3.05k • 315 • 13

upvoted a paper 9 months ago

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10, 2025 • 193

liked a dataset 9 months ago

HuggingFaceM4/FineVision

Viewer • Updated Oct 21, 2025 • 24.2M • 155k • 486

liked 2 models 10 months ago

zai-org/GLM-4.5V

Image-Text-to-Text • 108B • Updated Oct 25, 2025 • 180k • • 718

Skywork/Matrix-Game-2.0

Image-to-Video • Updated Apr 13 • 119 • 293