An approach to estimating cited sentences in academic papers using Doc2vec

Shunsuke Tanabe, Atsuhiro Takasu, Manabu Ohta, Jun Adachi

研究成果

抄録

Most academic authors refer to the literature when introducing their proposed methods and the data used in their experiments. These references can be very helpful when trying to understand a paper; however, some authors do not always state clearly the specific part of the referenced work they are referring the reader to and it can be quite labor-intensive to have to read the whole document to identify the relevant information. In this paper, we propose a method for estimating the appropriate parts of a referenced work as the “cited parts,” with the aim of reducing this burden. We first extract sentences in an academic paper that cites references to the literature as “citing sentences.” We then vectorize the citing sentences and all the sentences in the cited papers using doc2vec and estimate the most appropriate cited part as the sentence that has the most similar feature vector to that of the citing sentence. To evaluate the proposed method, we conducted experiments using English-language papers and a questionnaire survey that asked subjects to evaluate the appropriateness of the cited parts estimated by the method. The experiments showed that this approach’s success in estimating the appropriate parts of a cited paper as the cited parts depended on the citation intention of the citing sentences.

本文言語English
ホスト出版物のタイトルMEDES 2018 - 10th International Conference on Management of Digital EcoSystems
出版社Association for Computing Machinery, Inc
ページ118-125
ページ数8
ISBN(電子版)9781450356220
DOI
出版ステータスPublished - 9月 25 2018
イベント10th International Conference on Management of Digital EcoSystems, MEDES 2018 - Tokyo
継続期間: 9月 25 20189月 28 2018

出版物シリーズ

名前MEDES 2018 - 10th International Conference on Management of Digital EcoSystems

Other

Other10th International Conference on Management of Digital EcoSystems, MEDES 2018
国/地域Japan
CityTokyo
Period9/25/189/28/18

ASJC Scopus subject areas

  • コンピュータ グラフィックスおよびコンピュータ支援設計
  • コンピュータ ネットワークおよび通信
  • 環境工学

フィンガープリント

「An approach to estimating cited sentences in academic papers using Doc2vec」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル