SynthLabs

company

Verified

https://www.SynthLabs.ai

Activity Feed Request to join this org

AI & ML interests

Scaling up good synthetic reasoning. Post-training and synthetic data research lab.

authored 3 papers 6 months ago

Personalized Preference Fine-tuning of Diffusion Models

Paper • 2501.06655 • Published Jan 11, 2025 • 1

Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models

Paper • 2502.17387 • Published Feb 24, 2025 • 7

RLAD: Training LLMs to Discover Abstractions for Solving Reasoning Problems

Paper • 2510.02263 • Published Oct 2, 2025 • 9

updated a Space 8 months ago

README

published 2 models 10 months ago

SynthLabsAI/ALP_R1_Qwen1.5B

Reinforcement Learning • 2B • Updated Jun 24, 2025 • 9

SynthLabsAI/ALP_DeepScaleR_1.5B_C16K

Reinforcement Learning • 2B • Updated Jun 24, 2025 • 3 • 3

updated a collection 10 months ago

Adaptive Length Penalty

Teaching language models to think efficiently with Adaptive Length Penalty (ALP) • 3 items • Updated Jun 24, 2025 • 1

updated 2 models 10 months ago

SynthLabsAI/ALP_DeepScaleR_1.5B_C16K

Reinforcement Learning • 2B • Updated Jun 24, 2025 • 3 • 3

SynthLabsAI/ALP_R1_Qwen1.5B

Reinforcement Learning • 2B • Updated Jun 24, 2025 • 9

authored a paper 10 months ago

Just Enough Thinking: Efficient Reasoning with Adaptive Length Penalties Reinforcement Learning

Paper • 2506.05256 • Published Jun 5, 2025 • 2

updated a collection 10 months ago

Adaptive Length Penalty

Teaching language models to think efficiently with Adaptive Length Penalty (ALP) • 3 items • Updated Jun 24, 2025 • 1

authored 2 papers 10 months ago

OpenThoughts: Data Recipes for Reasoning Models

Paper • 2506.04178 • Published Jun 4, 2025 • 54

The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text

Paper • 2506.05209 • Published Jun 5, 2025 • 60

in SynthLabsAI/Big-Math-RL-Verified 12 months ago

Release of data without filtering for solve rate

#3 opened about 1 year ago by

Solution of the Problems

#6 opened about 1 year ago by

authored a paper 12 months ago

Learning Adaptive Parallel Reasoning with Language Models

Paper • 2504.15466 • Published Apr 21, 2025 • 44

updated a collection 12 months ago

Big-Math

This collection contains assets associated with the Big-Math dataset, a high-quality collection of over 250,000 math questions with verifiable answers • 4 items • Updated Apr 16, 2025 • 7

in SynthLabsAI/Big-Math-RL-Verified about 1 year ago

Adding an indicator of whether response requires LLM judge

#4 opened about 1 year ago by

updated a dataset about 1 year ago

SynthLabsAI/Big-Math-RL-Verified

Viewer • Updated Mar 25, 2025 • 251k • 5.04k • 226