TongZheng's picture

4 39 3

TongZheng PRO

TongZheng1999

·

https://kidzheng.github.io/

AI & ML interests

Natural Language Processing

Recent Activity

updated a model 8 days ago

TongZheng1999/HS_Model_4B_17k

published a model 8 days ago

TongZheng1999/HS_Model_4B_17k

updated a model 9 days ago

TongZheng1999/HS_Model_4B

View all activity

Organizations

commented a paper 4 months ago

CDE: Curiosity-Driven Exploration for Efficient Reinforcement Learning in Large Language Models

Paper • 2509.09675 • Published Sep 11, 2025 • 28 •

commented a paper 7 months ago

R1-RE: Cross-Domain Relationship Extraction with RLVR

Paper • 2507.04642 • Published Jul 7, 2025 • 7 •

commented 3 papers 8 months ago

Learning to Reason via Mixture-of-Thought for Logical Reasoning

Paper • 2505.15817 • Published May 21, 2025 • 18 •

Learning to Reason via Mixture-of-Thought for Logical Reasoning

Paper • 2505.15817 • Published May 21, 2025 • 18 •

Learning to Reason via Mixture-of-Thought for Logical Reasoning

Paper • 2505.15817 • Published May 21, 2025 • 18 •

commented a paper 11 months ago

Towards Optimal Multi-draft Speculative Decoding

Paper • 2502.18779 • Published Feb 26, 2025 • 5 •