|
Can you point me to algorithms or code for inverse reinforcement learning? |
|
I just implemented the 'projection' version of Pieter Abbeel's 'Apprenticeship Learning via Inverse Reinforcement Learning' algorithm in Python. I'll be updating it with something more approximate in the near future. It's an exact method, so it must enumerate all Q-states at this point. |