ALGORITHMIC AND PROGRAM IMPLEMENTATION OF THE PLAGIARISM DEFINITION IN LEARNING MANAGEMENT SYSTEMS
https://doi.org/10.21122/2309-4923-2018-1-71-78
Abstract
The main advantage of using information technologies in education, which consists in speeding up and simplifying of information exchange, is also its drawback, because it raises the problem of plagiarism. The purpose of this paper is to develop testing text software for uniqueness in learning management systems. To achieve this goal, it is necessary to solve a range of problems related to the choice of a method for determining plagiarism, its algorithmization and software implementation. The work deals with the methods of shingles, super-shingles, signature methods, vector models of text representation, as well as cluster analysis of text information. The authors suggest a modification of the vector model to improve the accuracy of determining similar documents by creating an N-list of each document separately. As a result, a pairwise comparison of the documents and the formation of the image of one document relative to the N-list of the other will occur. Thus, in the i-th row of the similarity matrix, the coefficients of similarity of all the documents considered relative to the i-th document will be recorded. The proposed modification will also speed up the calculation process, since there is no need to search for common terms for all documents. To analyze a large number of student’s works in order to test them for plagiarism, the authors propose using a cluster approach. Its application showed that the time for determining duplicates for one document and for all documents included in the sample is the same. For the same time it is possible to get all the options for the same works of students. Thus, the use of cluster analysis of text information in determining plagiarism significantly saves both the teacher’s time and computing resources. The software implementation of the proposed algorithms is implemented as a web service in the Java language.
Keywords
About the Authors
Y. B. PopovaBelarus
Yuliya B. Popova – PhD, Associate Professor of the Software Department
A. V. Goloburda
Belarus
Alexander Goloburda - Maste
References
1. Bobkova, O. V. Plagiat kak grazhdanskoe pravonarushenie / Bobkova O. V., Davydov S. A., Kovaleva I. A. // Patenty i licenzii. – 2016. – № 7. – S. 31–41.
2. Goloburda, A. V. Proverka plagiata v veb-prilozhenijah / A. V. Goloburda, Ju. B. Popova // Informacionnye tehnologii v obrazovanii, nauke i proizvodstve: IV Mezhdunarodnaja nauchno-tehnicheskaja internet-konferencija, 18–19 nojabrja 2016 g. Sekcija Informacionnye tehnologii v proizvodstve i nauchnyh issledovanijah [Jelektronnyj resurs]. – Rezhim dostupa: http://rep.bntu.by/handle/data/27126. – Data dostupa: 25.11.2017.
3. Popova, Ju. B. Klassifikacija avtomatizirovannyh sistem upravlenija obucheniem / Popova Ju. B. // Sistemnyj analiz i prikladnaja informatika. – 2016. – № 2. – S. 51–58.
4. Broder, A. On the resemblance and containment of documents / Broder A. // Compression and Complexity of Sequences (SEQUENCES’97). – IEEE Computer Society, 1998. – P. 21–29.
5. Zelenkova, Ju. G. Sravnitel’nyj analiz metodov opredelenija nechetkih dublikatov dlja Web-dokumentov / Zelenkova Ju. G., Segalovich I. V. // Trudy 9-oj Vserossijskoj nauchnoj konferencii «Jelektronnye biblioteki: perspektivnye metody i tehnologii, jelektronnye kollekcii»: sb. rabot uchastnikov konkursa. – Pereslavl’-Zalesskij, 2007. – T. 1. – S. 169–172.
6. Mochenov, S. V. Vektornaja model’ predstavlenija tekstovoj informacii / S. V. Mochenov, A. M. Blednov, Ju. A. Lugovskih // Materialy mezhdunarodnoj nauchnoj konferencii. – Izhevsk, 2006. – S. 133–139.
7. Andreev, A. M. Metod obnaruzhenija dublikatov v potoke tekstovyh dokumentov / Andreev A. M., Berezkin D. V., Kozlov I. A., Simakov K. V. // Trudy 16-j Vserossijskoj nauchnoj konferencii «Jelektronnye biblioteki: perspektivnye metody i tehnologii, jelektronnye kollekcii». – Dubna, 2014. – S. 310–321.
8. Antonova, A. Ju. Ob ispol’zovanii mer shodstva pri analize dokumentacii / Antonova A. Ju., Klyshinskij Je. S. // Trudy 13-j Vserossijskoj nauchnoj konferencii «Jelektronnye biblioteki: perspektivnye metody i tehnologii, jelektronnye kollekcii». – Voronezh, 2011. – S. 134–138.
9. Baranov, M. A. Modifikacija zhadnogo algoritma klasterizacii / Baranov M. A. // Prikladnaja informatika. – 2013. – № 2. – S. 78–88.
10. Barahnin, V. B. Klasterizacija tekstovyh dokumentov na osnove sostavnyh kljuchevyh termov / Barahnin V. B., Tkachev D. A. // Vestnik Novosibirskogo gosudarstvennogo universiteta. Serija: Informacionnye tehnologii. – 2010. – № 2. – S. 5–14.
Review
For citations:
Popova Y.B., Goloburda A.V. ALGORITHMIC AND PROGRAM IMPLEMENTATION OF THE PLAGIARISM DEFINITION IN LEARNING MANAGEMENT SYSTEMS. «System analysis and applied information science». 2018;(1):71-78. (In Russ.) https://doi.org/10.21122/2309-4923-2018-1-71-78