Abstract
Reinforcement learning is considered as an important tool for robotic learning in unknown/uncertain environments. In this paper, we propose an evaluation function expressed in a vector form to realize multi-dimensional reinforcement learning. The novel feature of the proposed method is that learning one behavior induces parallel learning of other behaviors though the objectives of each behavior are different. In brief, all behaviors watch other behaviors from a critical point of view. Therefore, in the proposed method, there is cross-criticism and parallel learning that make the multi-dimensional learning process more efficient. By applying the proposed learning method, we carried out multi-dimensional evaluation (reward) and multi-dimensional learning simultaneously in one trial. A special neural network (Q-net), in which the weights and the output are represented by vectors, is proposed to realize a critic network for Q-learning. The proposed learning method is applied for behavior planning of mobile robots.
Original language | English |
---|---|
Pages (from-to) | 142-148 |
Number of pages | 7 |
Journal | International Journal of Control, Automation and Systems |
Volume | 1 |
Issue number | 1 |
Publication status | Published - Mar 2003 |
Externally published | Yes |
Keywords
- Intelligent robot
- Multi-dimensional evaluation
- Neural networks
- Q-learning
- Reinforcement learning
ASJC Scopus subject areas
- Control and Systems Engineering
- Computer Science Applications