TY - JOUR
T1 - Extended QDSEGA for controlling real robots acquisition of locomotion patterns for snake-like robot
AU - Ito, Kazuyuki
AU - Kamegawa, Tetsushi
AU - Matsuno, Fumitoshi
PY - 2003/12/9
Y1 - 2003/12/9
N2 - Reinforcement learning is very effective for robot learning. Because it does not need prior knowledge and has higher capability of reactive and adaptive behaviors. In our previous works, we proposed new reinforce learning algorithm: "Q-learning with Dynamic Structuring of Exploration Space Based on Genetic Algorithm (QDSEGA)". It is designed for complicated systems with large action-state space like a robot with many redundant degrees of freedom. However the application of QDSEGA is restricted to static systems. A snake-like robot has many redundant degrees of freedom and the dynamics of the system are very important to complete the locomotion task. So application of usual reinforcement learning is very difficult. In this paper, we extend layered structure of QDSEGA so that it becomes possible to apply it to real robots that have complexities and dynamics. We apply it to acquisition of locomotion pattern of the snake-like robot and demonstrate the effectiveness and the validity of QDSEGA with the extended layered structure by simulation and experiment.
AB - Reinforcement learning is very effective for robot learning. Because it does not need prior knowledge and has higher capability of reactive and adaptive behaviors. In our previous works, we proposed new reinforce learning algorithm: "Q-learning with Dynamic Structuring of Exploration Space Based on Genetic Algorithm (QDSEGA)". It is designed for complicated systems with large action-state space like a robot with many redundant degrees of freedom. However the application of QDSEGA is restricted to static systems. A snake-like robot has many redundant degrees of freedom and the dynamics of the system are very important to complete the locomotion task. So application of usual reinforcement learning is very difficult. In this paper, we extend layered structure of QDSEGA so that it becomes possible to apply it to real robots that have complexities and dynamics. We apply it to acquisition of locomotion pattern of the snake-like robot and demonstrate the effectiveness and the validity of QDSEGA with the extended layered structure by simulation and experiment.
UR - http://www.scopus.com/inward/record.url?scp=0344444756&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0344444756&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:0344444756
SN - 1050-4729
VL - 1
SP - 791
EP - 796
JO - Proceedings - IEEE International Conference on Robotics and Automation
JF - Proceedings - IEEE International Conference on Robotics and Automation
T2 - 2003 IEEE International Conference on Robotics and Automation
Y2 - 14 September 2003 through 19 September 2003
ER -