Publications

Thommen George Karimpanal, and Erik Wilhelm. Identification and Off-Policy Learning of Multiple Objectives Using Adaptive Clustering. Neurocomputing, (263)2017. [PUMA: Learning, Learning Off-Policy, Adaptive Reinforcement Clustering, Q-learning, Multiobjective]