Skip to content

Question: About Action Clipping vs Joint Limits in locomotion Example #1285

Answered by Kashu7100
Total-Bots-Lab asked this question in Q&A
Discussion options

You must be logged in to vote

You can use tighter bounds if you like. This bound parameter is originally from legged gym (if I remember correctly).
The policy used here is PPO, whose outputs follow the Gaussian distribution (mean and std learned). So it will never likely to sample from the joint limits (in this case +/-100); that's why it doesn't really affect the policy learning.

Replies: 2 comments 4 replies

Comment options

You must be logged in to vote
1 reply
@Total-Bots-Lab
Comment options

Comment options

You must be logged in to vote
3 replies
@Total-Bots-Lab
Comment options

@Kashu7100
Comment options

@Total-Bots-Lab
Comment options

Answer selected by Kashu7100
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants