16 13

Amélie Dubois

page-watcher

AI & ML interests

None yet

Recent Activity

liked a model 4 days ago

mradermacher/GRaPE-2-Nano-i1-GGUF

upvoted a paper 4 days ago

Leveraging Verifier-Based Reinforcement Learning in Image Editing

liked a dataset 10 days ago

liutaopi/rarc-net-news

View all activity

Organizations

None yet

liked a model 4 days ago

mradermacher/GRaPE-2-Nano-i1-GGUF

0.8B • Updated 3 days ago • 3.03k • 1

upvoted a paper 4 days ago

Leveraging Verifier-Based Reinforcement Learning in Image Editing

Paper • 2604.27505 • Published 11 days ago • 57

liked a dataset 10 days ago

liutaopi/rarc-net-news

Updated 10 days ago • 26 • 1

upvoted 2 papers 17 days ago

WebGen-R1: Incentivizing Large Language Models to Generate Functional and Aesthetic Websites with Reinforcement Learning

Paper • 2604.20398 • Published 19 days ago • 3

DiPO: Disentangled Perplexity Policy Optimization for Fine-grained Exploration-Exploitation Trade-Off

Paper • 2604.13902 • Published 26 days ago • 62

upvoted a paper 27 days ago

Small Vision-Language Models are Smart Compressors for Long Video Understanding

Paper • 2604.08120 • Published Apr 9 • 20

liked a dataset 28 days ago

pjpjq/bybit-oi-ws-data

Updated 12 days ago • 4.42k • 6

liked a model 29 days ago

Oleksandrerfve/jghbhb

Updated 29 days ago

upvoted a paper 29 days ago

ClawBench: Can AI Agents Complete Everyday Online Tasks?

Paper • 2604.08523 • Published Apr 9 • 262

upvoted a paper about 1 month ago

DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models

Paper • 2603.26164 • Published Mar 27 • 364

liked a dataset about 1 month ago

DJLougen/wittgensite

Viewer • Updated about 1 month ago • 100 • 162 • 3

liked 3 models about 1 month ago

upvoted a paper about 1 month ago

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published Mar 20 • 350

upvoted 2 papers about 2 months ago

Demystifing Video Reasoning

Paper • 2603.16870 • Published Mar 17 • 371

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

Paper • 2603.04597 • Published Mar 4 • 210

upvoted a paper 2 months ago

Believe Your Model: Distribution-Guided Confidence Calibration

Paper • 2603.03872 • Published Mar 4 • 40

liked 2 models 2 months ago

Nanbeige/Nanbeige4.1-3B

Text Generation • 4B • Updated Mar 25 • 232k • • 1.1k

MiniMaxAI/MiniMax-M2.5

Text Generation • 229B • Updated Mar 10 • 920k • • 1.47k

Amélie Dubois

AI & ML interests

Recent Activity

Organizations

page-watcher's activity