TY - GEN
T1 - Statistical Learning Models for Japanese Essay Scoring Toward One-shot Learning
AU - Ejima, Chihiro
AU - Takeuchi, Koichi
N1 - Funding Information:
ACKNOWLEDGMENT A part of this research is supported by JSPS KAKENHI (Grant Number JP 22K00530).
Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - A lot of studies of automatic essay scoring are conducted using machine learning models. The previous studies show high performance for scoring large scale essays with machine learning models, however, more than hundreds of scored answers are required to train the neural network models. In this paper we discuss the possibility of one-shot learning, that is, using only one model essay as a training sample of a highest score. For this purpose, we apply regression models to estimate essay scores with different embedding models, that are, BERT and bag-of-words based encoding models. In preliminary experiments, feature analyses of one-shot learning with UMAP for the two embedding models reveal that the bag-of-words based model has more potential to score the test essays comparing to the BERT encoding model. Thus, to clarify the performance of the bag-of-words based encoding model, we conduct two experiments: firstly, we evaluate the performance of models to estimate the scores of test essays using 80% of score essays are used as training data; secondly, one-shot learning is applied to the models. The experimental results show that the proposed bag-of-words based encoding model is promising.
AB - A lot of studies of automatic essay scoring are conducted using machine learning models. The previous studies show high performance for scoring large scale essays with machine learning models, however, more than hundreds of scored answers are required to train the neural network models. In this paper we discuss the possibility of one-shot learning, that is, using only one model essay as a training sample of a highest score. For this purpose, we apply regression models to estimate essay scores with different embedding models, that are, BERT and bag-of-words based encoding models. In preliminary experiments, feature analyses of one-shot learning with UMAP for the two embedding models reveal that the bag-of-words based model has more potential to score the test essays comparing to the BERT encoding model. Thus, to clarify the performance of the bag-of-words based encoding model, we conduct two experiments: firstly, we evaluate the performance of models to estimate the scores of test essays using 80% of score essays are used as training data; secondly, one-shot learning is applied to the models. The experimental results show that the proposed bag-of-words based encoding model is promising.
KW - Automated essay scoring
KW - One-shot learning
KW - Support vector regression
UR - http://www.scopus.com/inward/record.url?scp=85139555706&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85139555706&partnerID=8YFLogxK
U2 - 10.1109/IIAIAAI55812.2022.00070
DO - 10.1109/IIAIAAI55812.2022.00070
M3 - Conference contribution
AN - SCOPUS:85139555706
T3 - Proceedings - 2022 12th International Congress on Advanced Applied Informatics, IIAI-AAI 2022
SP - 313
EP - 318
BT - Proceedings - 2022 12th International Congress on Advanced Applied Informatics, IIAI-AAI 2022
A2 - Matsuo, Tokuro
A2 - Takamatsu, Kunihiko
A2 - Ono, Yuichi
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 12th International Congress on Advanced Applied Informatics, IIAI-AAI 2022
Y2 - 2 July 2022 through 7 July 2022
ER -