2-s2.0-80054027538

[vc_empty_space][vc_empty_space]

Reinforcement learning with heuristic to solve POMDP problem in mobile robot path planning

Adiprawita W.^a, Ahmad A.S.^a, Sembiring J.^a, Trilaksono B.R.^a

^aSchool of Electrical Engineering and Informatics, Bandung Institute of Technology, Indonesia

[vc_row][vc_column][vc_row_inner][vc_column_inner][vc_separator css=”.vc_custom_1624529070653{padding-top: 30px !important;padding-bottom: 30px !important;}”][/vc_column_inner][/vc_row_inner][vc_row_inner layout=”boxed”][vc_column_inner width=”3/4″ css=”.vc_custom_1624695412187{border-right-width: 1px !important;border-right-color: #dddddd !important;border-right-style: solid !important;border-radius: 1px !important;}”][vc_empty_space][megatron_heading title=”Abstract” size=”size-sm” text_align=”text-left”][vc_column_text]In this paper we propose a method of presenting a special case of Value Function as a solution to POMDP in mobile robot navigation. By using this new method the Value Function complexity will be reduced and more intuitive. We also propose a new reinforcement learning method to solve the Value Function. This reinforcement learning is based on Bellman Equation augmented with A* like heuristic during update iteration. The result of this new Value Function is validated with This particle filter is simulaed in Matlab and also experimented physically using a simple autonomous mobile robot built with Lego Mindstorms NXT with 3 ultrasonic sonar and RWTH Mindstorms NXT Toolbox for Matlab to connect the robot to Matlab. This simulation and experiment also incorporate particle filter localization from previous research. The simulation and experiment show that the Value Function can be utilized very well. © 2011 IEEE.[/vc_column_text][vc_empty_space][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][vc_empty_space][megatron_heading title=”Author keywords” size=”size-sm” text_align=”text-left”][vc_column_text]Autonomous Mobile Robot,Lego mindstorm,POMDP,RWTH toolbox,Value functions,Value iteration[/vc_column_text][vc_empty_space][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][vc_empty_space][megatron_heading title=”Indexed keywords” size=”size-sm” text_align=”text-left”][vc_column_text]autonomous mobile robot,LEGO Mindstorm NXR,navigation,POMDP,RWTH toolbox,value function,value iteration[/vc_column_text][vc_empty_space][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][vc_empty_space][megatron_heading title=”Funding details” size=”size-sm” text_align=”text-left”][vc_column_text][/vc_column_text][vc_empty_space][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][vc_empty_space][megatron_heading title=”DOI” size=”size-sm” text_align=”text-left”][vc_column_text]https://doi.org/10.1109/ICEEI.2011.6021734[/vc_column_text][/vc_column_inner][vc_column_inner width=”1/4″][vc_column_text]Widget Plumx[/vc_column_text][/vc_column_inner][/vc_row_inner][/vc_column][/vc_row][vc_row][vc_column][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][/vc_column][/vc_row]

Enter your keyword

Reinforcement learning with heuristic to solve POMDP problem in mobile robot path planning