Automation of solving planimetry problems written in Ukrainian

O.P. Zhezherun, O.R. Smysh

Abstract


The article focuses on developing a software solution for solving planimetry problems that are written in Ukrainian. We discuss tendencies and available abilities in Ukrainian natural language processing. Presenting a comprehensive analysis of different types of describing a problem, which shows regularities in the formulation and structure of the text representation of problems. Also, we demonstrate the similarities of writing a problem not only in Ukrainian but also in Belarusian, English, and Russian languages. The final result of the paper is a system that uses the morphosyntactic analyzer to process a problem’s text and provide the answer to it. Ukrainian natural language processing is growing rapidly and showing impressive results. Huge possibilities appear as the Gold standard annotated corpus for Ukrainian language was recently developed. The created architecture is flexible, which indicates the possibility of adding both new geometry figures and their properties, as well as the additional logic to the program. The developed system with a little reformatting can be used with other natural languages, such as English, Belarusian or Russian, as the algorithm for text processing is universal due to the globally accepted representations for presenting such types of mathematical problems. Therefore, the further development of the system is possible.

Problems in programming 2020; 4: 71-80


Keywords


tokenization; lemmatization; Part-of-speech tagging; text segmentation; information extraction; annotated corpus

References


Reynar J. C. A Maximum Entropy Ap-proach to Identifying Sentence Boundaries. Reynar Jeffrey – Philadelph-ia, Pennsylvania, USA.

Sarawagi S. Information Extraction / Sara-wagi Sunita – Mumbai. 2008. CrossRef

ZNO z matematyky: osoblyvosti testu 2020 roku. (2019). Retrieved June 12, 2020, from https://osvita.ua/test/training/5017/

Kazakow V.U. Geometryja. Minsk: Narod-naja asveta. 2017.

Geometriya: 7–9-e klassy: uchebnik dlya obshcheobrazovatelnykh uchrezhdeniy. 2010.

Alexander D.C., Koeberlein G.M. Elemen-tary Geometry for College Students. Bel-mont. (5).

Velykyi elektronnyi slovnyk ukrainskoi movy (VESUM). (2017, November 30). GitHub Retrieved June 12, 2020, from https://github.com/brown-uk/dict_uk/blob/master/doc/announcement.md

Korobov M. Morphological analyzer and generator for Russian and Ukrainian lan-guages. Communications in Computer and

Information Science. 2015. Р. 320–332. CrossRef

Zolotyi morfosyntaksovyi standart. Labo-ratoriia ukrainskoi. Retrieved June 12.

, from https://mova.institute/золотий _стандарт.

Straka M. Tokenizing, POS Tagging, Lemmatizing and Parsing UD 2.0 with UDPipe / Straka Milan Vancouver. 2017. CrossRef




DOI: https://doi.org/10.15407/pp2020.04.071

Refbacks

  • There are currently no refbacks.