Question: About Action Clipping vs Joint Limits in locomotion Example #1285

Total-Bots-Lab · 2025-06-15T22:27:47Z

Total-Bots-Lab
Jun 15, 2025

Hi all,

In the genesis/examples/locomotion environment, I noticed a potentially confusing discrepancy between action clipping and joint limits.

In the step() function, actions are clipped like this:

self.actions = torch.clip(actions, -self.env_cfg["clip_actions"], self.env_cfg["clip_actions"])

And in the config, clip_actions is set to 100.0 in the train file:

"clip_actions": 100.0

Then the target positions are applied as:

target_dof_pos = exec_actions * self.env_cfg["action_scale"] + self.default_dof_pos
self.robot.control_dofs_position(target_dof_pos, self.motors_dof_idx)

However, according to the XML file, the joint limits (in radians) are significantly tighter, e.g.:

Abduction joints: [-1.0472, 1.0472]

Front thigh: [-1.5708, 3.4907]

Rear thigh: [-0.5236, 4.5379]

Knees: [-2.7227, -0.83776]

This raises two questions:

Why is clip_actions set to such a large and seemingly arbitrary value (±100), which is clearly outside the feasible joint angle ranges?

Is there an internal mechanism (e.g., action_scale) that keeps target_dof_pos within joint limits despite this large clipping range?

Would appreciate any clarification on how these design choices are intended to work.

Thanks!

Answered by Kashu7100

Jun 16, 2025

You can use tighter bounds if you like. This bound parameter is originally from legged gym (if I remember correctly).
The policy used here is PPO, whose outputs follow the Gaussian distribution (mean and std learned). So it will never likely to sample from the joint limits (in this case +/-100); that's why it doesn't really affect the policy learning.

View full answer

duburcqa · 2025-06-16T08:05:49Z

duburcqa
Jun 16, 2025
Collaborator

@Kashu7100 Any idea about this?

1 reply

Total-Bots-Lab Jun 16, 2025
Author

Thanks for the response.

Kashu7100 · 2025-06-16T10:13:11Z

Kashu7100
Jun 16, 2025
Collaborator

You can use tighter bounds if you like. This bound parameter is originally from legged gym (if I remember correctly).
The policy used here is PPO, whose outputs follow the Gaussian distribution (mean and std learned). So it will never likely to sample from the joint limits (in this case +/-100); that's why it doesn't really affect the policy learning.

3 replies

Total-Bots-Lab Jun 16, 2025
Author

Thanks for your response.
Could you please double-check my understanding regarding control_dofs_position?
Am I correct in assuming that the input values are in radians?
Also, are the position limits (in radians) the same as those I found in the XML file?

Kashu7100 Jun 22, 2025
Collaborator

yes, that should be correct.

Total-Bots-Lab Jun 27, 2025
Author

thanks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Question: About Action Clipping vs Joint Limits in locomotion Example #1285

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 2 comments 4 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Question: About Action Clipping vs Joint Limits in locomotion Example #1285

Uh oh!

Uh oh!

Total-Bots-Lab Jun 15, 2025

Replies: 2 comments · 4 replies

Uh oh!

duburcqa Jun 16, 2025 Collaborator

Uh oh!

Total-Bots-Lab Jun 16, 2025 Author

Uh oh!

Kashu7100 Jun 16, 2025 Collaborator

Uh oh!

Uh oh!

Total-Bots-Lab Jun 16, 2025 Author

Uh oh!

Kashu7100 Jun 22, 2025 Collaborator

Uh oh!

Total-Bots-Lab Jun 27, 2025 Author

Total-Bots-Lab
Jun 15, 2025

Replies: 2 comments 4 replies

duburcqa
Jun 16, 2025
Collaborator

Total-Bots-Lab Jun 16, 2025
Author

Kashu7100
Jun 16, 2025
Collaborator

Total-Bots-Lab Jun 16, 2025
Author

Kashu7100 Jun 22, 2025
Collaborator

Total-Bots-Lab Jun 27, 2025
Author