Reinforcement learning with musculoskeletal models(osim-rl.stanford.edu) |
Reinforcement learning with musculoskeletal models(osim-rl.stanford.edu) |
http://www.chinedufn.com/dual-quaternion-shader-explained/
They are magic.
Are y'all using penalty methods for the collisions? Which model does it use?
I wonder if you could get a better result by including other factors in the reward function, like trying to maintain a slight forward lean.