TY - GEN
T1 - Twitter catches the flu
T2 - Conference on Empirical Methods in Natural Language Processing, EMNLP 2011
AU - Aramaki, Eiji
AU - Maskawa, Sachiko
AU - Morita, Mizuki
PY - 2011/10/3
Y1 - 2011/10/3
N2 - With the recent rise in popularity and scale of social media, a growing need exists for systems that can extract useful information from huge amounts of data. We address the issue of detecting influenza epidemics. First, the proposed system extracts influenza related tweets using Twitter API. Then, only tweets that mention actual influenza patients are extracted by the support vector machine (SVM) based classifier. The experiment results demonstrate the feasibility of the proposed approach (0.89 correlation to the gold standard). Especially at the outbreak and early spread (early epidemic stage), the proposed method shows high correlation (0.97 correlation), which outperforms the state-of-the-art methods. This paper describes that Twitter texts reflect the real world, and that NLP techniques can be applied to extract only tweets that contain useful information.
AB - With the recent rise in popularity and scale of social media, a growing need exists for systems that can extract useful information from huge amounts of data. We address the issue of detecting influenza epidemics. First, the proposed system extracts influenza related tweets using Twitter API. Then, only tweets that mention actual influenza patients are extracted by the support vector machine (SVM) based classifier. The experiment results demonstrate the feasibility of the proposed approach (0.89 correlation to the gold standard). Especially at the outbreak and early spread (early epidemic stage), the proposed method shows high correlation (0.97 correlation), which outperforms the state-of-the-art methods. This paper describes that Twitter texts reflect the real world, and that NLP techniques can be applied to extract only tweets that contain useful information.
UR - http://www.scopus.com/inward/record.url?scp=80053290412&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=80053290412&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:80053290412
SN - 1937284115
SN - 9781937284114
T3 - EMNLP 2011 - Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference
SP - 1568
EP - 1576
BT - EMNLP 2011 - Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference
Y2 - 27 July 2011 through 31 July 2011
ER -