Problems of scaling semantic information resources with a complex structure

J.V. Rogushina, I. Yu. Grishanova

Abstract


We analyze scaling problems arising in modern intelligent information systems (IISs) and classify main reasons for their occurrence in their practical solutions. IISs integrate various elements of artificial intelligence (AI) for acquisition of knowledge relevant to actual user tasks. Important properties of these IISs are use of data with complex structure and orientation on semantic information resources (IRs). Therefore we analyze main features of the Data-Centric AI and opportunities for acquiring domain knowledge in various representations from Big Data. Knowledge organization systems (KOS) provide models and methods for effective store, retrieval and use of information processed by the Web-oriented IISs, and we consider existing approaches for their software platforms.We analyse the specifics of the scaling for systems focused on the semantic information processing and its differences from traditional data and Big Data scaling. This specifics is caused by complexity of data structure, number of various semantic relations between information objects into IR and complexity of semantic queries executed by KOS.

On example of e-VUE – the Wiki-portal of the Great Ukrainian Encyclopedia – we analyze various situations that arise in process of practical development of semantic information resources with large volume and complex structure. Various ways of semantic retrieval into this information resource that use possibilities of the Semantic MediaWiki plugin are considered from the point of view of scaling aspects (such as increase of information objects, their relations and complication of their structure and characteristics). On base of this analysis we generate a set of recommendations aimed at ensuring more efficient development of such resources and their efficient functioning for practical use.

Prombles in programming 2022; 3-4: 171-182


Keywords


semantic information resource; scaling; ontology; Wiki-technology; metadata; semantic markup

References


Data-Centric AI. (2021). The ultimate guide to the new AI paradigm. . Available from: https://resources.kili-technology.com/dcai-ebook-2022. [Accessed: 11.07 2022].

DEMCHENKO Y. & DE LAAT C. (2014) Membrey P. Defining architecture components of the Big Data Ecosystem. In 2014 International Conference on Collaboration Technologies and Systems (CTS), P. 104-112. CrossRef

CHEN M. & MAO S. & LIU Y. Big data: A survey. Mobile networks and applications, 19(2), 2014, P.171-209. CrossRef

ROGUSHINA J. (2016) Semantic Wiki resources and their use for the construction of personalized ontologies, CEUR Workshop Proceedings 1631, P.188-195. Available from: http://ceur-ws.org/Vol-1631/188-195.pdf. [Accessed: 11.07 2022]. (in Ukrainian) CrossRef

SOERGEL D. (2009). Knowledge organization systems: overview. Available from: www.dsoergel.com/UBLIS514DS-08.2a- 1Reading4SoergelKOSOverview.pdf. [Accessed: 07 2015].

HJORLAND B. (2008). What is knowledge organization (KO)? KO Knowledge Organization, 35(2-3), P.86-101. Available from: https://www.researchgate.net/profile/Birger-Hjorland/publication/277803483_What_is_Knowledge_Organization_KO/links/ 55d8232608aed6a199a6afce/What-is-Knowledge-Organization-KO.pdf. [Accessed: 15.07 2022]. CrossRef

HENDLER J. A. & GOLBECK J. (2008). Metcalfe's law, Web 2.0, and the Semantic Web. Web Sem., 6 (1), P. 14-20. CrossRef

WAGNER C. (2004). Wiki: A technology for conversational knowledge management and group collaboration The Communications of the Association for Information Systems, 13(1), P.264-289. CrossRef

VÖLKEL M. & KRÖTZSCH M. & VRANDECIC D. et al. (2006). Semantic wikipedia. Proc.e of the 15th international conference on World Wide Web, P.585-594.

CrossRef

ROGUSHYNA J. (2022) Use of knowledge organization systems based on ontologies in wiki-resources. Problems on Programming , 1, P.23-33. doi.org/10.15407/pp2022.01.23. (in Ukrainian)

DUNNING T. & FRIEDMAN E. AI and Analytics at Scale. Lessons from Real-World Production Systems. 2021. O'Reilly Media. Available from: https://www.oreilly.com/library/view/ai-and-analytics/9781492094388/. [Accessed: 02.07 2022].

BENLACHMI Y. & HSNAOUI M.L. Current State and Challenges of Big Data, 2020, CrossRef

ANDON P.I. & ROGUSHINA J.V. & GRISHANOVA I.Y. et all. Experience of Semantic Technologies Use for Development of Intelligent Web Encyclopedia. UkrPROG, CEUR Workshoop Proc., 2021, Vol-2866, P.246-259. Available from: http://ceur-ws.org/ Vol-2866/ceur_246-259andon24.pdf. [Accessed: 22.06 2022].




DOI: https://doi.org/10.15407/pp2022.03-04.171

Refbacks

  • There are currently no refbacks.