TongZheng's picture

4 36 3

TongZheng PRO

TongZheng1999

·

https://kidzheng.github.io/

AI & ML interests

Natural Language Processing

Recent Activity

updated a model 23 days ago

TongZheng1999/test_reasoning_1

published a model 23 days ago

TongZheng1999/test_reasoning_1

updated a model 23 days ago

TongZheng1999/test_reasoning

View all activity

Organizations

upvoted a paper 27 days ago

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

Paper • 2512.07461 • Published 29 days ago • 75

upvoted a paper about 1 month ago

Multi-Crit: Benchmarking Multimodal Judges on Pluralistic Criteria-Following

Paper • 2511.21662 • Published Nov 26, 2025 • 11

upvoted 3 papers about 2 months ago

First Frame Is the Place to Go for Video Content Customization

Paper • 2511.15700 • Published Nov 19, 2025 • 52

VisPlay: Self-Evolving Vision-Language Models from Images

Paper • 2511.15661 • Published Nov 19, 2025 • 42

Beyond English: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs

Paper • 2511.07003 • Published Nov 10, 2025 • 33

upvoted a paper 2 months ago

The Era of Agentic Organization: Learning to Organize with Language Models

Paper • 2510.26658 • Published Oct 30, 2025 • 27

upvoted 7 papers 3 months ago

StatEval: A Comprehensive Benchmark for Large Language Models in Statistics

Paper • 2510.09517 • Published Oct 10, 2025 • 6

NewtonBench: Benchmarking Generalizable Scientific Law Discovery in LLM Agents

Paper • 2510.07172 • Published Oct 8, 2025 • 28

DeepPrune: Parallel Scaling without Inter-trace Redundancy

Paper • 2510.08483 • Published Oct 9, 2025 • 24

Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution

Paper • 2509.25301 • Published Sep 29, 2025 • 19

VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal Reasoning

Paper • 2510.01444 • Published Oct 1, 2025 • 19

CLUE: Non-parametric Verification from Experience via Hidden-State Clustering

Paper • 2510.01591 • Published Oct 2, 2025 • 27

TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning

Paper • 2509.25760 • Published Sep 30, 2025 • 55

upvoted 3 papers 4 months ago

FlowRL: Matching Reward Distributions for LLM Reasoning

Paper • 2509.15207 • Published Sep 18, 2025 • 114

Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation

Paper • 2509.15194 • Published Sep 18, 2025 • 33

Look Again, Think Slowly: Enhancing Visual Reflection in Vision-Language Models

Paper • 2509.12132 • Published Sep 15, 2025 • 6

upvoted an article 4 months ago

Article

mem-agent: Persistent, Human Readable Memory Agent Trained with Online RL

Sep 11, 2025

•

25

upvoted 3 papers 4 months ago

EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs

Paper • 2509.09174 • Published Sep 11, 2025 • 61

CDE: Curiosity-Driven Exploration for Efficient Reinforcement Learning in Large Language Models

Paper • 2509.09675 • Published Sep 11, 2025 • 28

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10, 2025 • 190