TY - GEN
T1 - Proposal of device control method based on consensus building using reinforcement learning
AU - Oishi, Isato
AU - Tarutani, Yuya
AU - Fukushima, Yukinobu
AU - Yokohira, Tokumi
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021/1/13
Y1 - 2021/1/13
N2 - Various information is collected from IoT devices through the network. As such device becomes more familiar to the user, services are required to consider the influence of user. However, it is difficult to set the parameters of actuators that build consensus among all users in an environment where people with various preferences coexist. The conventional method minimizes the power consumption under the constraints of the user stress. However, this method has a problem that the calculation overhead is increased as the number of devices and users is increased. In this study, we propose a device control method based on consensus building with reinforcement learning. In the proposed method, the state is reduced by applying reinforcement learning for reducing the calculation overhead. As a result of evaluation, we clarified that our method obtains the device parameters that improve the reward by 1.5 times compared with the conventional method. Moreover, we also clarified that a reward value of 98.6% can be achieved compared to the optimum value.
AB - Various information is collected from IoT devices through the network. As such device becomes more familiar to the user, services are required to consider the influence of user. However, it is difficult to set the parameters of actuators that build consensus among all users in an environment where people with various preferences coexist. The conventional method minimizes the power consumption under the constraints of the user stress. However, this method has a problem that the calculation overhead is increased as the number of devices and users is increased. In this study, we propose a device control method based on consensus building with reinforcement learning. In the proposed method, the state is reduced by applying reinforcement learning for reducing the calculation overhead. As a result of evaluation, we clarified that our method obtains the device parameters that improve the reward by 1.5 times compared with the conventional method. Moreover, we also clarified that a reward value of 98.6% can be achieved compared to the optimum value.
KW - Consensus builder
KW - Internet of Things
KW - Reinforcement learning
UR - http://www.scopus.com/inward/record.url?scp=85100735845&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85100735845&partnerID=8YFLogxK
U2 - 10.1109/ICOIN50884.2021.9333958
DO - 10.1109/ICOIN50884.2021.9333958
M3 - Conference contribution
AN - SCOPUS:85100735845
T3 - International Conference on Information Networking
SP - 451
EP - 456
BT - 35th International Conference on Information Networking, ICOIN 2021
PB - IEEE Computer Society
T2 - 35th International Conference on Information Networking, ICOIN 2021
Y2 - 13 January 2021 through 16 January 2021
ER -