PUMA publications for /tag/Learning%20Reinforcement%20q-learning,%20learning,%20adaptivehttps://puma.ub.uni-stuttgart.de/tag/Learning%20Reinforcement%20q-learning,%20learning,%20adaptivePUMA RSS feed for /tag/Learning%20Reinforcement%20q-learning,%20learning,%20adaptive2024-03-29T06:49:06+01:00- Identification and Off-Policy Learning of Multiple Objectives Using Adaptive Clusteringhttps://puma.ub.uni-stuttgart.de/bibtex/2c335185fe98e82121532705bd1db18db/malte.heckelenmalte.heckelen2017-10-25T14:04:56+02:00Adaptive Clustering, Learning Learning, Multiobjective Off-Policy, Q-learning, Reinforcement <span data-person-type="author" class="authorEditorList "><span><span itemtype="http://schema.org/Person" itemscope="itemscope" itemprop="author"><a title="Thommen George Karimpanal" itemprop="url" href="/person/14ed4ca40820ec271e6eebdc7e76fd60b/author/0"><span itemprop="name">T. Karimpanal</span></a></span>, </span> and <span><span itemtype="http://schema.org/Person" itemscope="itemscope" itemprop="author"><a title="Erik Wilhelm" itemprop="url" href="/person/14ed4ca40820ec271e6eebdc7e76fd60b/author/1"><span itemprop="name">E. Wilhelm</span></a></span></span>. </span><span class="additional-entrytype-information"><span itemtype="http://schema.org/PublicationIssue" itemscope="itemscope" itemprop="isPartOf"><em><span itemprop="journal">Neurocomputing</span>, </em> </span>(<em><span>2017<meta content="2017" itemprop="datePublished"/></span></em>)</span>