Skip to main content
Onism Space
Blogs
Notes
DeepLearning
Meaningless
ExCore
Me on GitHub
6 docs tagged with "RL"
View all tags
Group Relative Policy Optimization
GRPO
pi0-A Vision-Language-Action Flow Model for General Robot Control
pi0
Proximal Policy Optimization(PPO)
PPO
SmolVLA-A Vision-Language-Action Model for Affordable and Efficient Robotics
小参数量打败pi0
Twist-Teleoperated Whole-Body Imitation System
全身遥操
VLFM-Vision-Language Frontier Maps for Zero-Shot Semantic Navigation
导航