horseRL by angelayixuanli · Pull Request #34 · sheim/QGym

angelayixuanli · 2025-11-19T08:45:58Z

Modified mini_cheetah for to add height commands.
Added horse model for training and integrated oscillators code. Currently unable to stabilize in a standing pose, still needs further config tuning.
Added helpful logging and plotting scripts. When you run play.py, logs are saved to joint_logs.npz. Then run scripts/analyze_joint_scaling.py or 'scripts/plot_logs_by_joint.py`
Added sweep config for automated hyperparameter sweeps for horse_osc

Modified files:

gym/envs/init.py
- added horse files
gym/envs/base/legged_robot.py and other mini_cheetah_*.py
- added height as a command
gym/utils/interfaces/KeyboardInterface.py
- add height as a command (up/down key)
scripts/play.py
- added joint metrics logging

New files:

gym/envs/horse/horse.py
gym/envs/horse/horse_config.py
gym/envs/horse/horse_osc.py
gym/envs/horse/horse_osc_config.py
resources/robots/horse/urdf/horse.urdf
scripts/analyze_joint_scaling.py
- generate histogram graphs of error and stats
scripts/plot_logs_by_joint.py
- generate graphs of actual_pos and target_pos with limit markers, grouped by joint
scripts/plot_logs_by_leg.py
- generate graphs of joint pos, grouped by leg
scripts/sweep_configs/sweep_horse_osc_config.json
- sweep config for horse_osc

Horse tweaks

sheim

Overall looks good. I put in some comments on things that can be trimmed down from mini_cheetah_osc to horse_osc.

gym/envs/horse/horse_osc.py

sheim · 2025-11-19T16:22:35Z

gym/envs/horse/horse_osc.py

+
+        self.osc_offset = stop * self.cfg.osc.offset
+        self.osc_omega = (
+            stop * self.cfg.osc.omega_stop


here also, let's simplify and remove all the _var randomness

removed torch.randn_like(self.osc_omega/coupling) * self.cfg.osc.omega/coupling_var

gym/envs/horse/horse_osc.py

sheim · 2025-11-19T16:27:25Z

gym/envs/horse/horse_osc.py

+        self.oscillators_vel *= 2 * torch.pi
+        self.oscillators += (
+            self.oscillators_vel * dt
+        )  # torch.clamp(self.oscillators_vel * dt, min=0)


zap the comment, and it'll go onto one line

removed from both files

sheim · 2025-11-19T16:33:09Z

gym/envs/horse/horse_osc.py

+        prod = torch.prod(torch.clip(combined_rew, 0, 1), dim=1)
+        return prod - torch.ones_like(prod)
+
+    # this is removed in mini_cheetah_osc


what do you mean?
Oh it isn't removed, mini_cheetah_osc inherits from mini_cheetah, so whatever we don't overload is still there.

got it! cleaned it up, just wanted to make note of differences I saw for easier comparison while working

sheim · 2025-11-19T16:34:34Z

gym/envs/horse/horse_osc.py

+        diff = torch.abs(theta1 - theta2) % (2 * torch.pi)
+        return torch.min(diff, 2 * torch.pi - diff)
+
+    def _reward_trot(self):


all these gait-specific rewards are rubbish that wasn't refactored, zap'em

removed everything from _reward_trot to _reward_asymmetric

sheim · 2025-11-19T16:35:05Z

gym/envs/horse/horse_osc.py

+        similarity *= self._sqrdexp(angle - torch.pi, torch.pi)
+        return similarity
+
+    def _reward_asymettric(self):


zap until this one (included)

removed everything from _reward_trot to _reward_asymmetric

.gitignore

angelayixuanli and others added 30 commits September 5, 2025 20:09

add height training

68aac8e

mini_cheetah with horse mass

90f3271

scale inertia

b4ed45c

baseline - stable cheetah

1582c75

baseline - split body

afb939c

Merge branch 'main' of github.com:sheim/QGym into yl/horseRL

da4739a

mini_cheetah with horse values

fb34dd5

urdf-debugger fixed

f9fa9d3

stable horse but can't stand

af0534b

realistic effort values

a880139

better results

c456a0c

manual merge of non-logging part of yl/horse_RL

5001e40

Merge branch 'main' into horse_tweaks

03538e5

good SWE cleanup

12dc7a4

WIP: partial urdf changes

e1b3733

reformat mini_cheetah urdf file for readability

9aae283

fix left legs

f155bb4

mirror left and right

318d55e

remove inertial offsets along y for symmetry

b247418

rename urdf for symmetry ease, some minor tweaks

5285384

Merge branch 'yl/horseRL' into horse_tweaks

ed90880

merge horse tweaks

aa7e536

Horse tweaks

plots separated by leg or joint

71985f6

log obs scaling

795cec0

update scaling

5d65cc9

split off horse on its own

4414757

fix play script

f674162

Merge pull request #33 from sheim/horse_tweaks

21a6913

Horse tweaks

add horse_osc

c7eb06f

error stats logging

dc21012

angelayixuanli added 2 commits November 19, 2025 00:20

wandb sweep config

2d37c69

cleanup

c0411ff

angelayixuanli requested a review from sheim November 19, 2025 08:47

sheim reviewed Nov 19, 2025

View reviewed changes

angelayixuanli added 8 commits November 20, 2025 22:39

more cleanup

ca10c7e

whoops still need randomize_osc_params

b9401ac

horse_tweaks branch + tuning

80f05d0

expand joint limits for lay down motion

91f2e7c

logs_by_joint new limits

1f547c4

add tendon constraints

935b4d0

attempt at smoothing descent rewards

70cfb20

plots and fix height pos and command

1957e3f

Conversation

angelayixuanli commented Nov 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sheim left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

angelayixuanli Nov 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

angelayixuanli commented Nov 19, 2025 •

edited

Loading

angelayixuanli Nov 21, 2025 •

edited

Loading