I guess someone has to be the negative one: I can't help feeling it's route to t...

superfx · on July 30, 2018

It looks that way because they're moving rapidly from one face configuration to another. But there's no way that's happening by random. I would guess that even just holding the cube constant in a dynamic grip is quite difficult.

toxik · on July 30, 2018

Agreed, it looks really uncoordinated. A lot of reinforcement learning algorithms have this problem, in my experience.

dgreensp · on July 31, 2018

I agree it looks sloppy, but that doesn’t mean it isn’t reliable. All it has to do in any given moment is make progress towards the goal of having the cube in the proper orientation, on average. It may be that it can do that very reliably even with noisy inputs and outputs.

poppingtonic · on July 30, 2018

Maybe if they randomized to n<=20 n-gons. I'd love to see Dactyl tackle a dodecahedron.