Problems of scaling semantic information resources with a complex structure

J. V. Rogushina, I. Yu. Grishanova

Abstract


We analyze scaling problems arising in modern intelligent information systems (IISs) and classify main reasons for their occurrence in their practical solutions. IISs integrate various elements of artificial intelligence (AI) for acquisition of knowledge relevant to actual user tasks. Important properties of these IISs are use of data with complex structure and orientation on semantic information resources (IRs). Therefore we analyze main features of the Data-Centric AI and opportunities for acquiring domain knowledge in various representations from Big Data. Knowledge organization systems (KOS) provide models and methods for effective store, retrieval and use of information processed by the Web-oriented IISs, and we consider existing approaches for their software platforms.We analyse the specifics of the scaling for systems focused on the semantic information processing and its differences from traditional data and Big Data scaling. This specifics is caused by complexity of data structure, number of various semantic relations between information objects into IR and complexity of semantic queries executed by KOS.

On example of e-VUE – the Wiki-portal of the Great Ukrainian Encyclopedia – we analyze various situations that arise in process of practical development of semantic information resources with large volume and complex structure. Various ways of semantic retrieval into this information resource that use possibilities of the Semantic MediaWiki plugin are considered from the point of view of scaling aspects (such as increase of information objects, their relations and complication of their structure and characteristics). On base of this analysis we generate a set of recommendations aimed at ensuring more efficient development of such resources and their efficient functioning for practical use.

Prombles in programming 2022; 3-4: 171-182


Keywords


semantic information resource; scaling; ontology; Wiki-technology; metadata; semantic markup

References


Data-Centric AI. (2021). The ultimate guide to the new AI paradigm. . Available from: https://resources.kili-technology.com/dcai- ebook-2022. [Accessed: 11.07 2022].

DEMCHENKO Y. & DE LAAT C. (2014) Membrey P. Defining architecture components of the Big Data Ecosystem. In 2014 International Conference on Collaboration Technologies and Systems (CTS), P. 104-112.

CHEN M. & MAO S. & LIU Y. Big data: A survey. Mobile networks and applications, 19(2), 2014, P.171-209.

ROGUSHINA J. (2016) Semantic Wiki resources and their use for the construction of personalized ontologies, CEUR Workshop Proceedings 1631, P.188-195. Available from: http://ceur-ws.org/Vol-1631/188-195.pdf. [Accessed: 11.07 2022]. (in Ukrainian)

SOERGEL D. (2009). Knowledge organization systems: overview. Available from: www.dsoergel.com/UBLIS514DS-08.2a- 1Reading4SoergelKOSOverview.pdf. [Accessed: 07 2015].

HJORLAND B. (2008). What is knowledge organization (KO)? KO Knowledge Organization, 35(2-3), P.86-101. Available from: https://www.researchgate.net/profile/Birger-Hjorland/publication/277803483_What_is_Knowledge_Organization_KO/links/ 55d8232608aed6a199a6afce/What-is-Knowledge-Organization-KO.pdf. [Accessed: 15.07 2022].

HENDLER J. A. & GOLBECK J. (2008). Metcalfe’s law, Web 2.0, and the Semantic Web. Web Sem., 6 (1), P. 14-20.

WAGNER C. (2004). Wiki: A technology for conversational knowledge management and group collaboration The Communications of the Association for Information Systems, 13(1), P.264-289.

VÖLKEL M. & KRÖTZSCH M. & VRANDECIC D. et al. (2006). Semantic wikipedia. Proc.e of the 15th international conference on World Wide Web, P.585-594.

ROGUSHYNA J. (2022) Use of knowledge organization systems based on ontologies in wiki-resources. Problems on Programming , 1, P.23-33. doi.org/10.15407/pp2022.01.23. (in Ukrainian)

DUNNING T. & FRIEDMAN E. AI and Analytics at Scale. Lessons from Real-World Production Systems. 2021. O’Reilly Media. Available from: https://www.oreilly.com/library/view/ai-and-analytics/9781492094388/. [Accessed: 02.07 2022].

BENLACHMI Y. & HSNAOUI M.L. Current State and Challenges of Big Data, 2020, DOI: 10.1007/978-3-030-33103-0.

ANDON P.I. & ROGUSHINA J.V. & GRISHANOVA I.Y. et all. Experience of Semantic Technologies Use for Development of Intelligent Web Encyclopedia. UkrPROG, CEUR Workshoop Proc., 2021, Vol-2866, P.246-259. Available from: http://ceur-ws.org/ Vol-2866/ceur_246-259andon24.pdf. [Accessed: 22.06 2022].




DOI: https://doi.org/10.15407/pp2022.03-04.171

Refbacks

  • There are currently no refbacks.