About one approach to automatic creation of formal queries to ontological knowledge bases
Abstract
The article develops an approach that includes the analysis of short natural language messages in Ukrainian and the automatic generation of queries in SPARQL and Cypher based on them. The Apache Jena Fuseki server is used as a SPARQL query processing tool, and the Neo4J graph database is used as a data warehouse or ontological knowledge base. The latter is the most common open source database, highperformance and well-scalable, i.e., capable of working with large amounts of data. In addition, approaches to building formal queries based on natural language queries for Cypher are little known and require further development. The approach is based on the fact that a user's natural language query is subjected to a series of sequential checks. Their results determine the set of semantic types expressed in the phrase (natural language query) and the corresponding concepts that define them. The result of these checks is a set of four values – the codes of the check results, as well as the subjects and predicates, if present. This information is enough to select a set of basic templates for formal queries. Based on the results of such basic checks, the main basic templates for generating the final request are created. The proposed approach has a basic query template aimed at obtaining information of a certain type in a given form, as well as additional modifier templates that optionally construct query strings in the corresponding blocks of the main query by introducing additional conditions. The article describes the process of automatic generation of SPARQL queries to a contextual ontology using the example of a knowledge base of medical articles from peer-reviewed open access journals. The peculiarity of the approach is that the formal query is automatically built from blocks of templates (main and auxiliary), which are customizable in accordance with certain semantic categories present in the analyzed text and the entities that specify them.
Prombles in programming 2024; 2-3: 326-333
Keywords
Full Text:
PDFReferences
Gomez-Perez A., Fernandez-Lopez M., Corcho O. Ontological Engineering. Advanced Information and Knowledge Processing. Springer-Verlag, London, 1 edition, 2004.
Studer R. Staab S., editor. Handbook on Ontologies. Springer Berlin Heidelberg, Berlin, Heidelberg, 2 editions, 2009.
OntoChatGPT information system: Ontologydriven structured prompts for ChatGPT metalearning / Palagin O., Kaverinskiy V., Litvin A., Malakhov K. International Journal of Computing, 22(2):170–183, July 2023.
Ford M. Rule of the Robots: How Artificial Intelligence Will Transform Everything. Basic Books, New York, first edition, 2021.
Malakhov, K., Petrenko, M., Cohn, E. (2023). Developing an ontology based system for semantic processing of scientific digital libraries. South African Computer Journal, 2023 35(1), 19–36.
Proceedings of the 13th International Scientific and Practical Programming Conference UkrPROG 2022. Kyiv, Ukraine, October 11–12, 2022.
M.G. Petrenko, O.V. Palagin, M.O. Boyko, S.M. Matveyshyn. Knowledge-Oriented Tool Complex for Developing Databases of Scientific Publications and Taking into account Semantic Web Technology. Control Systems and Computers, 2022, Issue 3 (299), pp. 11–28.
Goel A. Neo4J Cookbook. Birmingham: Pact Publishing Ltd. May 28, 2015., 1st Edition. 226p.
Bob DuCharme. Learning SPARQL. Querying and Updating with SPARQL 1.1 (Second edition), O’Reilly Media, All rights reserved, August 20 3. 367р.
Khan A. Knowledge graphs querying. ACM SIGMOD Record. 2023. Vol. 52, N 2. pp. 18–29.
A Dictionary of Computer Science. Oxford University Press. Seventh edition. Editors for this edition Andrew Butterfield BA, BAI, PhD, Gerard Ekembe Ngondi BEng, MSc. 1211 p.
A. Litvin, V. Velychko, and V. Kaverinsky. A new approach to automatic ontology creation from the untagged text on the natural language of inflective type, Proceedings of the International conference on software engineering "Soft Engine 2022", NAU, Kyiv Ukraine, 2022, pp. 37– 45.
A. Litvin, V. Velychko, and V. Kaverinsky. A New Approach to Automatic Ontology Generation from the Natural Language Texts with Complex Inflection Structures in the Dialogue Systems Development, CEUR Workshop Proceedings, 2023, Vol. 3501. pp. 172–185.
Kaverinsky, V., Malakhov, K. Natural Language-Driven Dialogue Systems for Support in Physical Medicine and Rehabilitation, South African Computer Journal, 2023, Vol. 35, No. 2, pp. 119 – 126.
A. Litvin, V. Velychko, and V. Kaverinsky. Method of information obtaining from ontology on the basis of a natural language phrase analysis, in: CEUR Workshop Proceedings, CEUR-WS, Kyiv, Ukraine, 2020: pp. 323–330.
O.V. Palagin, M.G. Petrenko, S. Yu. Svitla, V.YU. Velychko. About one approach to analyzing and understanding natural language objects. Computer tools, networks and systems. 2008, №7. pp. 128–137.
O. Palagin, V. Kaverinsky, A. Litvin, and K. Malakhov. Ontology driven development of dialogue systems, South African Computer Journal. – Vol. 35. No. 1. – 2023. – pp. 37 – 62.
Petrenko, N.G. Computer ontologies and ontology-driven architecture of information systems. Book "Information Models of Knowledge", ITHEA, Kiev, Ukraine – Sofia, Bulgaria, 2010, pp. 86–92.
Kurgaev, A. F., & Petrenko, M. G. (1995). Processor structure design. Cybernetics and Systems Analysis, 31(4), pp. 618–625.
Petrenko, M., & Sofiyuk, A. (2003). On one approach to the transfer of an information structures interpreter to PLD-implementation. Upravlyayushchie Sistemy i Mashiny, 188(6), pp. 48–57.
Petrenko, M., & Kurgaev, A. (2003). Distinguishing features of design of a modern circuitry type processor. Upravlyayushchie Sistemy Mashiny, 187(5), 16–19.
Alexander V. Palagin, Mykola N. Petrenko. Methodological Foundations for Development, Formation and IT-support of Transdisciplinary Research // Journal of Automation and Information Sciences, Volume 50, 2018, Issue 10, pp. 1–17.
Inefuku, H., Malakhov, K., Cohn, E. R., & Collister, L. B. (2023). Service Diversification, Connections, and Flexibility in Library Publishing: Rapid Publication of Research from Ukraine in Wartime. Case Studies in Library Publishing, 1(1).
Malakhov, K. S. (2023a). Insight into the Digital Health System of Ukraine (eHealth): Trends, Definitions, Standards, and Legislative Revisions. International Journal of Telerehabilitation, 15(2), 1–21.
Malakhov, K. S. (2023b). Letter to the Editor – Update from Ukraine: Development of the Cloud-based Platform for Patient-centered Telerehabilitation of Oncology Patients with Mathematical-related Modeling. International Journal of Telerehabilitation, 15(1), 1–3.
Palagin, O. V., Malakhov, K. S., Velychko, V. Yu., & Semykopna, T. V.(2022). Hybrid erehabilitation services: SMART-system for remote support of rehabilitation activities and services. International Journal of Telerehabilitation, Special Issue: Research Status Report – Ukraine, 1-16.
Malakhov, K. (2022). Letter to the Editor – Update from Ukraine: Rehabilitation and Research. International Journal of Telerehabilitation, 14(2), 1–2.
O. Palagin, V. Kaverinsky, M. Petrenko, and K. Malakhov, Digital Health Systems: Ontology-based Universal Dialog Service for Hybrid E rehabilitation Activities Support, The 12th IEEE International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications. – Dortmund, Germany, 2023. – pp. 84 –89.
Refbacks
- There are currently no refbacks.