Author of the publication

Policy Search in a Space of Simple Closed-form Formulas: Towards Interpretability of Reinforcement Learning.

, , , and . Discovery Science, volume 7569 of Lecture Notes in Computer Science, page 37-51. Springer, (2012)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Imitative Learning for Online Planning in Microgrids., , , , and . DARE, volume 9518 of Lecture Notes in Computer Science, page 1-15. Springer, (2015)Lipschitz robust control from off-policy trajectories., , , and . CDC, page 4924-4931. IEEE, (2014)Imitative learning for real-time strategy games., , , and . CIG, page 424-429. IEEE, (2012)Inferring bounds on the performance of a control policy from a sample of trajectories., , , and . ADPRL, page 117-123. IEEE, (2009)Estimation Monte Carlo sans modèle de politiques de décision., , , and . Rev. d'Intelligence Artif., 25 (3): 321-343 (2011)Aggregating Optimistic Planning Trees for Solving Markov Decision Processes., , and . NIPS, page 2382-2390. (2013)Optimistic planning for belief-augmented Markov Decision Processes., , and . ADPRL, page 77-84. IEEE, (2013)Using approximate dynamic programming for estimating the revenues of a hydrogen-based high-capacity storage device., , and . ADPRL, page 1-8. IEEE, (2014)Batch mode reinforcement learning based on the synthesis of artificial trajectories., , , and . Annals OR, 208 (1): 383-416 (2013)Critical Time Windows for Renewable Resource Complementarity Assessment., , , , , , , , , and . CoRR, (2018)