TY - GEN
T1 - Filtering Documents for Plagiarism Detection
AU - Baba, Kensuke
N1 - Publisher Copyright:
© 2018, Springer Nature Switzerland AG.
PY - 2018
Y1 - 2018
N2 - Efficient methods are required for plagiarism detection. This paper proposes a fast and scalable method for detecting “copy and paste”-type plagiarism in documents. Implementing detection methods for this type of plagiarism requires a long processing time or a large database for comprehensive matching of ordered word occurrences. The author improved the scalability of an existing fast method based on fast Fourier transform using the idea of the frequency domain filtering. He evaluated the effect of the improvement on accuracy of the plagiarism detection method, and achieved an effective trade-off between the accuracy and the required size of database.
AB - Efficient methods are required for plagiarism detection. This paper proposes a fast and scalable method for detecting “copy and paste”-type plagiarism in documents. Implementing detection methods for this type of plagiarism requires a long processing time or a large database for comprehensive matching of ordered word occurrences. The author improved the scalability of an existing fast method based on fast Fourier transform using the idea of the frequency domain filtering. He evaluated the effect of the improvement on accuracy of the plagiarism detection method, and achieved an effective trade-off between the accuracy and the required size of database.
KW - Fast Fourier transform
KW - Filtering
KW - Plagiarism detection
KW - Text processing
KW - Vector representation of words
UR - http://www.scopus.com/inward/record.url?scp=85055782564&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85055782564&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-01771-2_23
DO - 10.1007/978-3-030-01771-2_23
M3 - Conference contribution
AN - SCOPUS:85055782564
SN - 9783030017705
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 361
EP - 372
BT - Discovery Science - 21st International Conference, DS 2018, Proceedings
A2 - Ceci, Michelangelo
A2 - Soldatova, Larisa
A2 - Vanschoren, Joaquin
A2 - Papadopoulos, George
PB - Springer Verlag
T2 - 21st International Conference on Discovery Science, DS 2018
Y2 - 29 October 2018 through 31 October 2018
ER -