TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?

Authors
Joshua Romoff
Peter Henderson
David Kanaa
Emmanuel Bengio
Ahmed Touati
Pierre-Luc Bacon
Joelle Pineau
Publication
Theoretical Foundations of RL Workshop and Beyond First order methods in ML Systems Workshop (ICML), 2020 and 20th International Conference on Autonomous Agents and Multiagent Systems (AAMAS)
Date