Bipedal Locomotion

I worked on reinforcement-learning-based bipedal robot locomotion with the Bolt10 model. The main task is to learn a robust policy that transfers well to the real world. A state-estimator MLP is concurrently trained to perform adaptive locomotion on unfamiliar terrain. The framework is shown below.

Network structure

Simulation and training are conducted in IsaacGym (NVIDIA, CUDA 12.2) using the rsl-rl RL framework. Sim-to-sim verification is done in MuJoCo. Simultaneous training of the state estimator has shown improvements in sim-to-sim transfer. I intend to deploy the trained policy on a real Bolt6 robot.

Walking in IsaacGym

Walking in MuJoCo

Other motions developed during training

Crouched walking

Swing walking

Poster presentation at SNU GSCST (2024.08.29) DYROS poster

Minjun Chang