Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Chenyan Xiong Research Group at CMU

university
https://www.cs.cmu.edu/~cx/
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

zhongshsh  updated a dataset 10 days ago
cx-cmu/Researchy-GEO
zhongshsh  updated a dataset 10 days ago
cx-cmu/GEO-Bench
zhongshsh  updated a dataset 10 days ago
cx-cmu/E-commerce
View all activity

Papers

RePro: Training Language Models to Faithfully Recycle the Web for Pretraining

View all Papers

Chenyan Xiong's profile picture Cassandra Cohen's profile picture Shanshan Zhong's profile picture  Zichun Yu's profile picture Jingyuan He's profile picture Mahima Jagadeesh Patel's profile picture zhihan zhang's profile picture yujiang wu's profile picture Kira Jones's profile picture

cx-cmu 's collections 1

RePro
Space for RePro: Training Language Models to Faithfully Recycle the Web for Pretraining
  • cx-cmu/repro-rephraser-4B

    Text Generation • 196k • Updated Oct 18 • 25 • 2
  • cx-cmu/repro-rl-data

    Viewer • Updated Oct 18 • 41k • 25
  • RePro: Training Language Models to Faithfully Recycle the Web for Pretraining

    Paper • 2510.10681 • Published Oct 12 • 5
  • cx-cmu/repro-rephrased-data-72B

    Viewer • Updated Oct 18 • 39M • 945
RePro
Space for RePro: Training Language Models to Faithfully Recycle the Web for Pretraining
  • cx-cmu/repro-rephraser-4B

    Text Generation • 196k • Updated Oct 18 • 25 • 2
  • cx-cmu/repro-rl-data

    Viewer • Updated Oct 18 • 41k • 25
  • RePro: Training Language Models to Faithfully Recycle the Web for Pretraining

    Paper • 2510.10681 • Published Oct 12 • 5
  • cx-cmu/repro-rephrased-data-72B

    Viewer • Updated Oct 18 • 39M • 945
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs