[vc_empty_space][vc_empty_space]
Joint action optimation for robotic soccer multiagent using reinforcement learning method
Sari S.C.a, Kuspriyantob, Prihatmanto A.S.b
a Department of Electrical Engineering, Faculty of Engineering, General Achmad Yani University, Indonesia
b Department of Electrical Engineering, School of Electrical Engineering and Informatic, Bandung Institute of Technology, Indonesia
[vc_row][vc_column][vc_row_inner][vc_column_inner][vc_separator css=”.vc_custom_1624529070653{padding-top: 30px !important;padding-bottom: 30px !important;}”][/vc_column_inner][/vc_row_inner][vc_row_inner layout=”boxed”][vc_column_inner width=”3/4″ css=”.vc_custom_1624695412187{border-right-width: 1px !important;border-right-color: #dddddd !important;border-right-style: solid !important;border-radius: 1px !important;}”][vc_empty_space][megatron_heading title=”Abstract” size=”size-sm” text_align=”text-left”][vc_column_text]In order to fulfill some tasks to reach a certain common goal, agents need to make sequence of decisions they have to perform as agroup. The decision is taken based on a selection mechanism of available actions. Choosing arbitrary action will lead to time and energy waste, since not all actions are even optimum. Agents need to decide not only which individual action that will lead to optimum performance, but also their joint actions. Applying reinforcement learning in the multiagent’s learning process gives a sequence of optimum joint actions, which collaboration of agents based on this optimum joint actions guarantees the fastest time to reach their goal. © 2012 IEEE.[/vc_column_text][vc_empty_space][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][vc_empty_space][megatron_heading title=”Author keywords” size=”size-sm” text_align=”text-left”][vc_column_text]Energy wastes,Joint actions,Learning process,Markov Decision Processes,Multi-agent reinforcement learning,Optimum performance,Reinforcement learning method,Robotic soccer,Selection mechanism[/vc_column_text][vc_empty_space][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][vc_empty_space][megatron_heading title=”Indexed keywords” size=”size-sm” text_align=”text-left”][vc_column_text]decision-making,Multiagent Markov Decision Process,multiagent reinforcement learning,Robotic soccer agent[/vc_column_text][vc_empty_space][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][vc_empty_space][megatron_heading title=”Funding details” size=”size-sm” text_align=”text-left”][vc_column_text][/vc_column_text][vc_empty_space][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][vc_empty_space][megatron_heading title=”DOI” size=”size-sm” text_align=”text-left”][vc_column_text]https://doi.org/10.1109/ICSEngT.2012.6339298[/vc_column_text][/vc_column_inner][vc_column_inner width=”1/4″][vc_column_text]Widget Plumx[/vc_column_text][/vc_column_inner][/vc_row_inner][/vc_column][/vc_row][vc_row][vc_column][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][/vc_column][/vc_row]