Benchmarks are particularly useful for characterizing algorithms and determining their usefulness in different settings. Here I highlight the need for more standardization of performance evaluations in inverse reinforcement learning.
In recent years, significant progress has been made in solving challenging problems using deep learning. Reproducing existing work and accurately judging the improvements offered by novel methods is vital to sustaining this progress. Unfortunately, …
A brief discussion on some difficulties that students may encounter in reproducing modern policy gradient methods in continuous control tasks and best practices for writing papers on these methods.
A practical tutorial session on implementing and running policy gradient methods for continuous control tasks.