Author of the publication

Deterministic Policy Gradient Algorithms.

, , , , , and . ICML, volume 32 of JMLR Workshop and Conference Proceedings, page 387-395. JMLR.org, (2014)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Learning to Pass Expectation Propagation Messages., , and . NIPS, page 3219-3227. (2013)Bayes-Adaptive Simulation-based Search with Value Function Approximation., , , and . NIPS, page 451-459. (2014)Learning a Generative Model of Images by Factoring Appearance and Shape., , , and . Neural Computation, 23 (3): 593-650 (2011)Gradient Estimation Using Stochastic Computation Graphs., , , and . CoRR, (2015)Direct Policy Gradients: Direct Optimization of Policies in Discrete Action Spaces., , , , and . CoRR, (2019)Recurrent Models of Visual Attention., , , and . CoRR, (2014)Value constrained model-free continuous control., , , , , and . CoRR, (2019)FeUdal Networks for Hierarchical Reinforcement Learning., , , , , , and . ICML, volume 70 of Proceedings of Machine Learning Research, page 3540-3549. PMLR, (2017)The Termination Critic., , , , , and . AISTATS, volume 89 of Proceedings of Machine Learning Research, page 2231-2240. PMLR, (2019)Rigorous Agent Evaluation: An Adversarial Approach to Uncover Catastrophic Failures., , , , , , , , and . ICLR (Poster), OpenReview.net, (2019)