PUMA publications for /tag/Learning%20Reinforcement%20q-learning,%20learning,%20adaptivehttps://puma.ub.uni-stuttgart.de/tag/Learning%20Reinforcement%20q-learning,%20learning,%20adaptivePUMA RSS feed for /tag/Learning%20Reinforcement%20q-learning,%20learning,%20adaptive2024-03-29T15:23:24+01:00- Identification and Off-Policy Learning of Multiple Objectives Using Adaptive Clusteringhttps://puma.ub.uni-stuttgart.de/bibtex/2c335185fe98e82121532705bd1db18db/malte.heckelenmalte.heckelen2017-10-25T14:04:56+02:00Adaptive Clustering, Learning Learning, Multiobjective Off-Policy, Q-learning, Reinforcement <span data-person-type="author" class="authorEditorList "><span><span itemtype="http://schema.org/Person" itemscope="itemscope" itemprop="author"><a title="Thommen George Karimpanal" itemprop="url" href="https://puma.ub.uni-stuttgart.de/person/14ed4ca40820ec271e6eebdc7e76fd60b/author/0"><span itemprop="name">T. Karimpanal</span></a></span>, </span> and <span><span itemtype="http://schema.org/Person" itemscope="itemscope" itemprop="author"><a title="Erik Wilhelm" itemprop="url" href="https://puma.ub.uni-stuttgart.de/person/14ed4ca40820ec271e6eebdc7e76fd60b/author/1"><span itemprop="name">E. Wilhelm</span></a></span></span>. </span>(<em><span>2017<meta content="2017" itemprop="datePublished"/></span></em>)<a href="https://puma.ub.uni-stuttgart.de/bibtex/2c335185fe98e82121532705bd1db18db/malte.heckelen"><i>Identification and Off-Policy Learning of Multiple Objectives Using Adaptive Clustering.</i></a><em><span itemprop="journal">Neurocomputing</span>, </em><em>volume 263. </em>
[<a href="https://puma.ub.uni-stuttgart.de/">PUMA</a>:
<a href="https://puma.ub.uni-stuttgart.de/user/malte.heckelen/Adaptive">Adaptive</a> <a href="https://puma.ub.uni-stuttgart.de/user/malte.heckelen/Clustering,">Clustering,</a> <a href="https://puma.ub.uni-stuttgart.de/user/malte.heckelen/Learning">Learning</a> <a href="https://puma.ub.uni-stuttgart.de/user/malte.heckelen/Learning,">Learning,</a> <a href="https://puma.ub.uni-stuttgart.de/user/malte.heckelen/Multiobjective">Multiobjective</a> <a href="https://puma.ub.uni-stuttgart.de/user/malte.heckelen/Off-Policy,">Off-Policy,</a> <a href="https://puma.ub.uni-stuttgart.de/user/malte.heckelen/Q-learning,">Q-learning,</a> <a href="https://puma.ub.uni-stuttgart.de/user/malte.heckelen/Reinforcement">Reinforcement</a>]