copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Hierarchical Reinforcement Learning Approach for Autonomous Cross-Country Soaring

S. Notter, F. Schimpf, G. Müller, and W. Fichter. Journal of Guidance, Control, and Dynamics, 46 (1): 114-126 (January 2023)
DOI: 10.2514/1.G006746

Abstract

Solving the decision-making problem between pursuing the objective of covering distance and exploiting thermal updrafts is the central challenge in cross-country soaring flight. The need for trading short-term rewarding actions against actions that pay off in the long term makes for a hard-to-solve problem. Policies resulting from reinforcement learning offer the potential to handle long-term correlations between actions taken and rewards received. The paper presents a reinforcement learning setup, which results in a control strategy for the autonomous soaring sample application of GPS Triangle racing. First, we frame the problem in terms of a Markov decision process. In particular, we present a straightforward model for the three-degrees-of-freedom system dynamics of a glider aircraft that does not make any simplifying assumptions regarding the wind field or the relative aircraft velocity. The competition task is decomposed into subtasks, then. Stochastic gradient ascent solves the associated hierarchical reinforcement learning problem without the designer employing any further, potentially deficient heuristics. We present an implementation of the overall policy alongside an updraft estimator on embedded hardware aboard an unpiloted glider aircraft. Flight-test results validate the successful transfer of the hierarchical control policy trained in simulation to real-world autonomous cross-country soaring.

@stefannotter's tags highlighted

Cite this publication

@article{doi:10.2514/1.G006746, abstract = {Solving the decision-making problem between pursuing the objective of covering distance and exploiting thermal updrafts is the central challenge in cross-country soaring flight. The need for trading short-term rewarding actions against actions that pay off in the long term makes for a hard-to-solve problem. Policies resulting from reinforcement learning offer the potential to handle long-term correlations between actions taken and rewards received. The paper presents a reinforcement learning setup, which results in a control strategy for the autonomous soaring sample application of GPS Triangle racing. First, we frame the problem in terms of a Markov decision process. In particular, we present a straightforward model for the three-degrees-of-freedom system dynamics of a glider aircraft that does not make any simplifying assumptions regarding the wind field or the relative aircraft velocity. The competition task is decomposed into subtasks, then. Stochastic gradient ascent solves the associated hierarchical reinforcement learning problem without the designer employing any further, potentially deficient heuristics. We present an implementation of the overall policy alongside an updraft estimator on embedded hardware aboard an unpiloted glider aircraft. Flight-test results validate the successful transfer of the hierarchical control policy trained in simulation to real-world autonomous cross-country soaring. }, added-at = {2022-10-25T09:44:56.000+0200}, author = {Notter, Stefan and Schimpf, Fabian and Müller, Gregor and Fichter, Walter}, biburl = {https://puma.ub.uni-stuttgart.de/bibtex/23d6f0e0ba7ca4c68c202d998cad36c63/stefannotter}, doi = {10.2514/1.G006746}, editor = {Lu, Ping}, eprint = {https://doi.org/10.2514/1.G006746}, interhash = {ef340c57dbf0ca4d38cb8929085c668e}, intrahash = {3d6f0e0ba7ca4c68c202d998cad36c63}, journal = {Journal of Guidance, Control, and Dynamics}, keywords = {ifr ifrsend:unibiblio myown}, month = jan, number = 1, pages = {114-126}, timestamp = {2022-12-27T10:36:59.000+0100}, title = {Hierarchical Reinforcement Learning Approach for Autonomous Cross-Country Soaring}, url = {https://doi.org/10.2514/1.G006746 }, volume = 46, year = 2023 }

PUMA

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Hierarchical Reinforcement Learning Approach for Autonomous Cross-Country Soaring

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

PUMA

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Hierarchical Reinforcement Learning Approach for Autonomous Cross-Country Soaring

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Hierarchical Reinforcement Learning Approach for Autonomous Cross-Country Soaring

Comments and Reviews
(0)