Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Policy Search in a Space of Simple Closed-form Formulas: Towards Interpretability of Reinforcement Learning.

F. Maes, R. Fonteneau, L. Wehenkel, and D. Ernst. Discovery Science, volume 7569 of Lecture Notes in Computer Science, page 37-51. Springer, (2012)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Raphael Vogler University of Stuttgart

Raphael Menges University of Stuttgart

Raphael Rohrmüller University of Stuttgart

Raphael Leiteritz University of Stuttgart

Raphael Nägele University of Stuttgart

Other publications of authors with the same name

Imitative Learning for Online Planning in Microgrids.S. Aittahar, V. François-Lavet, S. Lodeweyckx, D. Ernst, and R. Fonteneau. DARE, volume 9518 of Lecture Notes in Computer Science, page 1-15. Springer, (2015)Lipschitz robust control from off-policy trajectories.R. Fonteneau, D. Ernst, B. Boigelot, and Q. Louveaux. CDC, page 4924-4931. IEEE, (2014)Imitative learning for real-time strategy games.Q. Gemine, F. Safadi, R. Fonteneau, and D. Ernst. CIG, page 424-429. IEEE, (2012)Inferring bounds on the performance of a control policy from a sample of trajectories.R. Fonteneau, S. Murphy, L. Wehenkel, and D. Ernst. ADPRL, page 117-123. IEEE, (2009)Estimation Monte Carlo sans modèle de politiques de décision.R. Fonteneau, S. Murphy, L. Wehenkel, and D. Ernst. Rev. d'Intelligence Artif., 25 (3): 321-343 (2011)Aggregating Optimistic Planning Trees for Solving Markov Decision Processes.G. Kedenburg, R. Fonteneau, and R. Munos. NIPS, page 2382-2390. (2013)Optimistic planning for belief-augmented Markov Decision Processes.R. Fonteneau, L. Busoniu, and R. Munos. ADPRL, page 77-84. IEEE, (2013)Using approximate dynamic programming for estimating the revenues of a hydrogen-based high-capacity storage device.V. François-Lavet, R. Fonteneau, and D. Ernst. ADPRL, page 1-8. IEEE, (2014)Batch mode reinforcement learning based on the synthesis of artificial trajectories.R. Fonteneau, S. Murphy, L. Wehenkel, and D. Ernst. Annals OR, 208 (1): 383-416 (2013)Critical Time Windows for Renewable Resource Complementarity Assessment.M. Berger, R. David, R. Fonteneau, R. Henry, M. Glavic, X. Fettweis, M. Du, P. Panciatici, L. Balea, and D. Ernst. CoRR, (2018)