Preview

REALIZATION OF TRAINING PROGRAMME ON THE BASIS OF LINGUISTIC DATABASE FOR AUTOMATIC TEXTS PROCESSING SYSTEM

Abstract

Due to the constant increasing of electronic textual information, modern society needs for the automatic processing of natural language (NL). The main purpose of NL automatic text processing systems is to analyze and create texts and represent their content. The purpose of the paper is the development of linguistic and software bases of an automatic system for processing English publicistic texts. This article discusses the examples of different approaches to the creation of linguistic databases for processing systems. The author gives a detailed description of basic building blocks for a new linguistic processor: lexical-semantic, syntactical and semantic-syntactical. The main advantage of the processor is using special semantic codes in the alphabetical dictionary. The semantic codes have been developed in accordance with a lexical-semantic classification. It helps to precisely define semantic functions of the keywords that are situated in parsing groups and allows the automatic system to avoid typical mistakes. The author also represents the realization of a developed linguistic database in the form of a training computer program.

About the Author

M. A. Makarych
Belarusian National Technical University
Belarus
Makarych Marina -  PhD in Applied and mathematical linguistics, Associate Professor of the 2nd English Department 


References

1. R. G. Piotrovsky, Automatic text analysis and synthesis methods. Minsk: Vyshejshaya shkola, 1985.– 222 p.

2. A. V. Vorontsov, Industrial implementation of a system for lexical and grammatical analysis of text documents. J. Vestn. MSLU, Vol. 1(26), pp. 189–203, 2007.

3. D. Jurafsky, Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition. New Jersey: Prentice Hall, 2000. – 934 p.

4. J. Todhunter, I. Sovpel, D. Pastanohau, Semantic processor for recognition of whole-part relations in natural language documents: US Patent Appl. 20070156393; Intention Machine Corp. – Serial no. 686660; Series code 11; Filed 15.03.2007.

5. N. N. Leonteva, Automatic text interpretation: Systems, models, resources: Handbook for students of linguistic faculties. Moscow: Akademy, 2006–304 с.

6. I. V. Sovpel, Automatic recognition of basic knowledge in text. J. Artificial intelligence, Vol. 3, pp. 328–332, 2007.

7. A. V. Zubov, A semantic and syntactic language for text entry in computer memory. Functioning and development of language systems: Collection of scientific papers. Minsk: Vyshejshaya shkola, pp. 110–117,1985.

8. M. V. Makarych, Automatic system for creating a table abstract of texts. Germany: LAP LAMBERT Academic Publishing, 2012–145 p.


Review

For citations:


Makarych M.A. REALIZATION OF TRAINING PROGRAMME ON THE BASIS OF LINGUISTIC DATABASE FOR AUTOMATIC TEXTS PROCESSING SYSTEM. «System analysis and applied information science». 2016;(1):78-83. (In Russ.)

Views: 904


Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.


ISSN 2309-4923 (Print)
ISSN 2414-0481 (Online)