📚 Humanoid Robot Learning Paper Notebooks

300 papers Update Log

⭐️ open-source code link in note

📍 推荐学习路线图

Click a paper node to open its note; click elsewhere on the diagram to zoom in.

No matching papers found.

📄 Foundational RL (15)

Foundational RL theory and classic algorithms — essential prerequisites for understanding subsequent research

Paper tags ordered by recommended learning path

Proximal Policy Optimization Algorithms (PPO) Jul 20, 2017 (arXiv) ⭐️
Advantage Weighted Regression (AWR) Sep 30, 2019 (arXiv) ⭐️
DeepMimic: Example-Guided Deep RL of Physics-Based Character Skills Apr 8, 2018 (arXiv), SIGGRAPH 2018 ⭐️
AMP: Adversarial Motion Priors for Stylized Physics-Based Character Control Apr 5, 2021 (arXiv), SIGGRAPH 2021 ⭐️
PHC: Perpetual Humanoid Control for Real-time Simulated Avatars May 10, 2023 (arXiv), ICCV 2023 ⭐️
ADD: Adversarial Disentanglement and Distillation May 8, 2025 (arXiv), SIGGRAPH Asia 2025 ⭐️
ASE: Adversarial Skill Embeddings for Large-Scale Motion Control May 4, 2022 (arXiv), SIGGRAPH 2022 (ACM TOG) ⭐️
CALM: Conditional Adversarial Latent Models for Directable Virtual Characters May 2, 2023 (arXiv), SIGGRAPH 2023 ⭐️
PULSE: Universal Humanoid Motion Representations for Physics-Based Control Oct 2023 (arXiv), May 2024 (ICLR) ⭐️
Diffusion Policy: Visuomotor Policy Learning via Action Diffusion Mar 2023 (arXiv), RSS 2023 ⭐️
BeyondMimic: From Motion Tracking to Versatile Humanoid Control via Guided Diffusion Aug 2025 (arXiv) ⭐️
Understanding Domain Randomization for Sim-to-real Transfer Oct 2021 (v1), Mar 2022 (v2) (arXiv)
Domain Randomization for Transferring Deep Neural Networks from Simulation to the Real World Mar 2017 (arXiv) ⭐️
Learning Smooth Humanoid Locomotion through Lipschitz-Constrained Policies (LCP) Oct 15, 2024 (arXiv) ⭐️
MimicKit: A Reinforcement Learning Framework for Motion Imitation and Control Oct 15, 2025 initial release; v4 Jan 18, 2026 (arXiv) ⭐️

🎯 Motion Retargeting (4)

Bridging human motion data and humanoid policies — geometric IK, learned retargeting, and the engineering choices that determine downstream policy quality

Paper tags ordered by arXiv date, oldest first

📄 High Impact Selection (23)

Curated high-impact and milestone papers in humanoid robot learning

Paper tags ordered by publication date, oldest first (same within subcategories)

Whole-Body Control Core (6)

Paper tags ordered by publication date, oldest first

Teleoperation & Imitation Learning (3)

Paper tags ordered by publication date, oldest first

Locomotion Classics (6)

Paper tags ordered by publication date, oldest first

Sim-to-Real & Foundation Model (4)

Paper tags ordered by publication date, oldest first

Simulation Platform & Tools (4)

Paper tags ordered by publication date, oldest first

🦾 Loco-Manipulation and WBC (63)

Paper tags ordered by arXiv date, newest first

🦾 Manipulation (47)

Paper tags ordered by arXiv date, newest first

🎮 Teleoperation (24)

Paper tags ordered by arXiv date, newest first

🦿 Locomotion (14)

Paper tags ordered by arXiv date, newest first

🧭 Navigation (19)

Paper tags ordered by arXiv date, newest first

📡 State Estimation (9)

Paper tags ordered by arXiv date, newest first

🔄 Sim-to-Real (12)

Core theory and algorithms are under 01 Foundational RL (Domain Randomization, LCP). This category will host Sim-to-Real-specific works (Real-to-Sim, ADR, Privileged Learning, etc.).

Paper tags ordered by arXiv date, newest first

🔧 Hardware Design (10)

Paper tags ordered by arXiv date, newest first

🖥️ Simulation Benchmark (22)

Paper tags ordered by arXiv date, newest first

🎬 Physics-Based Animation (9)

Paper tags ordered by arXiv date, newest first

🏃 Human Motion (29)

Paper tags ordered by arXiv date, newest first