Enter your keyword

2-s2.0-84991810824

[vc_empty_space][vc_empty_space]

State elimination in accelerated Multiagent Reinforcement Learning

Prihatmanto A.S.a, Adiprawita W.a, Sari S.C.a, Kuspriyantoa

a Department of Electrical Engineering, School of Electrical Engineering and Informatics, Bandung Institute of Technology, Bandung, 40132, Indonesia

[vc_row][vc_column][vc_row_inner][vc_column_inner][vc_separator css=”.vc_custom_1624529070653{padding-top: 30px !important;padding-bottom: 30px !important;}”][/vc_column_inner][/vc_row_inner][vc_row_inner layout=”boxed”][vc_column_inner width=”3/4″ css=”.vc_custom_1624695412187{border-right-width: 1px !important;border-right-color: #dddddd !important;border-right-style: solid !important;border-radius: 1px !important;}”][vc_empty_space][megatron_heading title=”Abstract” size=”size-sm” text_align=”text-left”][vc_column_text]© 2016, School of Electrical Engineering and Informatics. All rights reserved.This paper presents a novel algorithm of Multiagent Reinforcement Learning called State Elimination in Accelerated Multiagent Reinforcement Learning (SEA-MRL), that successfully produces faster learning without incorporating internal knowledge or human intervention such as reward shaping, transfer learning, parameter tuning, and even heuristics, into the learning system. Since the learning speed is determined among others by the size of the state space where the larger the state space the slower learning might become, reducing the state space can lead to faster convergence. SEA-MRL distinguishes insignificant states of the state space from the significant ones and then eliminating them in early learning episodes, which aggressively reduces the scale of the state space in the following learning episodes. Applying SEA-MRL in gridworld multi robot navigation shows 1.62 times faster in achieving learning convergence. This algorithm is generally applicable for other multiagent task challenges or general multiagent learning with large scale state space, and perfectly applicable with no adjustments for single agent learning situation.[/vc_column_text][vc_empty_space][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][vc_empty_space][megatron_heading title=”Author keywords” size=”size-sm” text_align=”text-left”][vc_column_text][/vc_column_text][vc_empty_space][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][vc_empty_space][megatron_heading title=”Indexed keywords” size=”size-sm” text_align=”text-left”][vc_column_text]Accelerated Reinforcement Learning,Machine learning,Multiagent learning,Multiagent Reinforcement Learning,Reinforcement learning,State space reduction[/vc_column_text][vc_empty_space][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][vc_empty_space][megatron_heading title=”Funding details” size=”size-sm” text_align=”text-left”][vc_column_text][/vc_column_text][vc_empty_space][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][vc_empty_space][megatron_heading title=”DOI” size=”size-sm” text_align=”text-left”][vc_column_text]https://doi.org/10.15676/ijeei.2016.8.3.12[/vc_column_text][/vc_column_inner][vc_column_inner width=”1/4″][vc_column_text]Widget Plumx[/vc_column_text][/vc_column_inner][/vc_row_inner][/vc_column][/vc_row][vc_row][vc_column][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][/vc_column][/vc_row]