-
VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model
Paper • 2509.09372 • Published • 244 -
VLA-R1: Enhancing Reasoning in Vision-Language-Action Models
Paper • 2510.01623 • Published • 10 -
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Paper • 2509.02547 • Published • 228 -
WMPO: World Model-based Policy Optimization for Vision-Language-Action Models
Paper • 2511.09515 • Published • 18
Shirley-Chen
Shirley-Chen
AI & ML interests
Reasoning / multi-modal LLM | CoT | RL | SFT
Organizations
None yet
CoT Compression / Efficiency
-
Reasoning with OmniThought: A Large CoT Dataset with Verbosity and Cognitive Difficulty Annotations
Paper • 2505.10937 • Published • 1 -
QFFT, Question-Free Fine-Tuning for Adaptive Reasoning
Paper • 2506.12860 • Published • 18 -
Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching
Paper • 2503.05179 • Published • 46 -
The Quest for Efficient Reasoning: A Data-Centric Benchmark to CoT Distillation
Paper • 2505.18759 • Published • 14
Continual Learning - VLA
-
VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model
Paper • 2509.09372 • Published • 244 -
VLA-R1: Enhancing Reasoning in Vision-Language-Action Models
Paper • 2510.01623 • Published • 10 -
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Paper • 2509.02547 • Published • 228 -
WMPO: World Model-based Policy Optimization for Vision-Language-Action Models
Paper • 2511.09515 • Published • 18
CoT Compression / Efficiency
-
Reasoning with OmniThought: A Large CoT Dataset with Verbosity and Cognitive Difficulty Annotations
Paper • 2505.10937 • Published • 1 -
QFFT, Question-Free Fine-Tuning for Adaptive Reasoning
Paper • 2506.12860 • Published • 18 -
Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching
Paper • 2503.05179 • Published • 46 -
The Quest for Efficient Reasoning: A Data-Centric Benchmark to CoT Distillation
Paper • 2505.18759 • Published • 14