Misc,

Emergence of Chemotactic Strategies with Multi-Agent Reinforcement Learning

S. Tovey, C. Lohrmann, and C. Holm.
(2024)
DOI: https://doi.org/10.48550/arXiv.2404.01999

Abstract

Reinforcement learning (RL) is a flexible and efficient method for programming micro-robots in complex environments. Here we investigate whether reinforcement learning can provide insights into biological systems when trained to perform chemotaxis. Namely, whether we can learn about how intelligent agents process given information in order to swim towards a target. We run simulations covering a range of agent shapes, sizes, and swim speeds to determine if the physical constraints on biological swimmers, namely Brownian motion, lead to regions where reinforcement learners’ training fails. We find that the RL agents can perform chemotaxis as soon as it is physically possible and, in some cases, even before the active swimming overpowers the stochastic environment. We study the efficiency of the emergent policy and identify convergence in agent size and swim speeds. Finally, we study the strategy adopted by the reinforcement learning algorithm to explain how the agents perform their tasks. To this end, we identify three emerging dominant strategies and several rare approaches taken. These strategies, whilst producing almost identical trajectories in simulation, are distinct and give insight into the possible mechanisms behind which biological agents explore their environment and respond to changing conditions.

BibTeX key: Tovey24a
entry type: misc
year: 2024
eprint: 2404.01999
archiveprefix: arXiv
primaryclass: physics.bio-ph
DOI: https://doi.org/10.48550/arXiv.2404.01999

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

@misc{Tovey24a, abstract = {Reinforcement learning (RL) is a flexible and efficient method for programming micro-robots in complex environments. Here we investigate whether reinforcement learning can provide insights into biological systems when trained to perform chemotaxis. Namely, whether we can learn about how intelligent agents process given information in order to swim towards a target. We run simulations covering a range of agent shapes, sizes, and swim speeds to determine if the physical constraints on biological swimmers, namely Brownian motion, lead to regions where reinforcement learners’ training fails. We find that the RL agents can perform chemotaxis as soon as it is physically possible and, in some cases, even before the active swimming overpowers the stochastic environment. We study the efficiency of the emergent policy and identify convergence in agent size and swim speeds. Finally, we study the strategy adopted by the reinforcement learning algorithm to explain how the agents perform their tasks. To this end, we identify three emerging dominant strategies and several rare approaches taken. These strategies, whilst producing almost identical trajectories in simulation, are distinct and give insight into the possible mechanisms behind which biological agents explore their environment and respond to changing conditions.}, added-at = {2024-04-16T10:16:03.000+0200}, archiveprefix = {arXiv}, author = {Tovey, Samuel and Lohrmann, Christoph and Holm, Christian}, biburl = {https://puma.ub.uni-stuttgart.de/bibtex/2c92872df42e23b3a66ca77d23015795b/simtech}, description = {[2404.01999] Emergence of Chemotactic Strategies with Multi-Agent Reinforcement Learning}, doi = {https://doi.org/10.48550/arXiv.2404.01999}, eprint = {2404.01999}, interhash = {b53bbe44709b8e3cc82555c5cc46f607}, intrahash = {c92872df42e23b3a66ca77d23015795b}, keywords = {PN3 PN3A-1 EXC2075}, primaryclass = {physics.bio-ph}, timestamp = {2024-04-16T10:16:57.000+0200}, title = {Emergence of Chemotactic Strategies with Multi-Agent Reinforcement Learning}, year = 2024 }

PUMA

Emergence of Chemotactic Strategies with Multi-Agent Reinforcement Learning

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on