Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Paipile's picture
5 1

Paipile

Paipile

AI & ML interests

None yet

Recent Activity

upvoted a paper 8 days ago
Towards Scalable Pre-training of Visual Tokenizers for Generation
updated a collection 3 months ago
RFT
upvoted a collection 3 months ago
VisionLM
View all activity

Organizations

None yet

Collections 1

RFT
  • Group Sequence Policy Optimization

    Paper • 2507.18071 • Published Jul 24 • 316
  • LAPO: Internalizing Reasoning Efficiency via Length-Adaptive Policy Optimization

    Paper • 2507.15758 • Published Jul 21 • 35
  • Hierarchical Budget Policy Optimization for Adaptive Reasoning

    Paper • 2507.15844 • Published Jul 21 • 16
  • Semi-off-Policy Reinforcement Learning for Vision-Language Slow-thinking Reasoning

    Paper • 2507.16814 • Published Jul 22 • 21
RFT
  • Group Sequence Policy Optimization

    Paper • 2507.18071 • Published Jul 24 • 316
  • LAPO: Internalizing Reasoning Efficiency via Length-Adaptive Policy Optimization

    Paper • 2507.15758 • Published Jul 21 • 35
  • Hierarchical Budget Policy Optimization for Adaptive Reasoning

    Paper • 2507.15844 • Published Jul 21 • 16
  • Semi-off-Policy Reinforcement Learning for Vision-Language Slow-thinking Reasoning

    Paper • 2507.16814 • Published Jul 22 • 21

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs