Application of ontological analysis for metadata processing in the interpretation of BIG DATA at the semantic level
Abstract
The paper considers the main aspects of modern technologies applied for knowledge analysis to obtain information from Big Data. The analysis of the current state of research in this area shows that background knowledge subject areas of user interest represented by domain ontologies can be used both in order to effectively analysis of information acquried from certain sets of Big Data, and to make this acquisition more useful. With the help of such ontologies, users can formally describe the scope of their information needs, define the structure of the required information objects and explicitly highlight critical for current task domain aspects. Subject of rocessing in the semantics analysis of Big Data is their metadata usually represented by unstructured natural language text. We need to standardize the representation of meta-descriptions wit use of appropriate ontologies that determine the structure and content of individual elements of metadata.
Problems in programming 2020; 4: 55-70
Keywords
Full Text:
PDF (Українська)References
Metadata. - https://uk.wikipedia.org/wiki/Метадані
Dublin Core Metadata Initiative. DCMI TYPE Vocabulary.- http://dublincore.org/documents/demitype-vocabulary. (in Ukrainian)
Reznichenko V.A., Zakharova O.V., Zakharova E.G. Electronic libraries: information resources and services. Problems in programming. 2005. № 4. P.60-72. (in Ukrainian)
Berners-Lee T., Hendler J., Lassila O. The semantic web. Scientific american. 2001. 284(5). P. 34-43.
Dunsire G., Willer M. Standard library metadata models and structures for the Semantic Web. Library hi tech news. 2011. CrossRef
Kogalovsky M.R. Metadata, their properties, functions, classification and presentation means. Proc. of the 14th All-Russian Scientific Conference "Digital Libraries: Promising Methods and Technologies, Electronic Collections" - RCDL-2012, 2012. http:ceur-ws.org/Vol-934/paper3.pdf. (in Russian)
Grotschel M., Lugger J. Scientific Informa¬tion System and Metadata. Konrad-Zuse-Zentrum fur Informationstechnik. Berlin. http://www.zib.de/ groetschel/pubnew/paper/groetschelluegger 1999.pdf
Halshofer B., Klas W. A Survey of Techni¬ques for Achieving Metadata Interoperability. ACM Computing Surveys. 2010. Vol. 42. No. 2. Article 7. CrossRef
Taylor C. An Introduction to Metadata. The University of Queensland, Australia. http://www.libraty.uq.edu.au/papers/ctmeta4.html
Lagose C. Metadata for the Web. Cornell University. CS 431 - March 2. 2005.
Feng L., Brussee R., Blanken H., Veenstra M. Languages for Metadata. In: Multimedia Retrieval. Data-Centric Systems and Applications, Springer, 23-51. http://www.springerlink.com/ content/m276p88003533q86/. CrossRef
Jeusfeld M.A. Metadata. In: Encyclopedia of Database Systems, Springer. 2009. Р. 1723- 1724. http ://www. springerlink.com/content/ h241167167r35055/. CrossRef
Corcho O. Ontology based document annotation: trends and open research problems. Intern. Journal of Metadata, Semantics and Ontologies. 2006. Vol. 1. Is. 1. http://www.dia.fi.upm.es/~ocorcho/documents/IJMSO2006_Corcho.pdf CrossRef
Gladun A., Rogushina J. Repositories of ontologies as a means of knowledge reuse for recognition of information objects. Ontology of design. 2013. N 1 (7). P. 35-50. (in Russian)
Overbeek J. F. Meta Object Facility (MOF): investigation of the state of the art. 2006. http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.96.4092&rep=rep1&type=pdf.
OWL Web Ontology Language. Overview. W3C Recommendation: W3C, 2009. - http://www.w3.org/TR/owl-features/.
Kobelev A.E., Vyazilov E.D. Modern approaches to metadata creating. Modern problems of remote sensing of the Earth from space. 2010. 7 (4). P. 194-203. http://d33.infospace.ru/d33_conf/sb2010t4/194-203.pdf. (in Ukrainian)
Unstructured_data. - https://en.wikipedia.org/ wiki/Unstructured_data.
ROGUSHINA J. (2019) Means and methods of unstructured data analysis. // Problems in programming, N 1, P. 57-77. http://pp.isofts.kiev.ua/ojs1/article/view/348/346. (in Ukrainian)
Andon P., Rogushina J., Grishanova I., Reznichenko V., Kyrydon A., Aristova A., Tyschenko A. (2020) Experience of the semantic technologies use for intelligent Web encyclopedia creation (on example of the Great Ukrainian Encyclopedia portal). Problems in programming, N 2-3. P. 246-258. (in Ukrainian) CrossRef
Rogushina J. Use of Semantic Similarity Estimates for Unstructured Data Analysis CEUR Vol-2577, Selected Papers of the XIX International Scientific and Practical Conference "Information Technologies and Security" (ITS 2019). Kyiv. 2019. P. 246-258. http://ceur-ws.org/Vol-2577/ paper20.pdf.
Demchenko Y., De Laat C., Membrey P. Defining architecture components of the Big Data Ecosystem. In 2014 International Conference on Collaboration Technologies and Systems (CTS). 2014. P. 104-112. CrossRef
Smith K., Seligman L., Rosenthal A., Kurcz C., Greer M., Macheret C., Eckstein A. "Big Metadata" The Need for Principled Metadata Management in Big Data Ecosystems. Proceedings of Workshop on Data analytics in the Cloud. 2014. P. 1-4). CrossRef
Dey A., Chinchwadkar G., Fekete A., Ramachandran K. Metadata-as-a-service. 31st IEEE International Conference on Data Engineering Workshops. 2015. P. 6-9. CrossRef
Chen M., Mao S., Liu Y. Big data: A survey. Mobile networks and applications. 2014. 19(2). P. 171-209. CrossRef
Rogushina J., Gladun A., Pryima S. Use of Ontologies for Metadata Records Analysis in Big Data. Selected Papers of the XVIII International Scientific and Practical Conference "Information Technologies and Security" (ITS 2018). CEUR Vol-2318. http://ceur-ws.org/Vol-2318/paper5.pdf.
ISO 15489-1:2016 Information and documentation - Records management - Part 1: Concepts and principles.
ISO 15836-1:2017 Information and documentation - The Dublin Core metadata element set - Part 1: Core elements.
ISO 15836-2:2019 Information and documentation - The Dublin Core metadata element set - Part 2: DCMI Properties and classes.
DSTU ISO 15489-1: 2018 Information and documentation. Records management. Part 1. Concepts and principles (ISO 15489-1: 2016, IDT). (in Ukrainian)
DSTU ISO 15836-1: 2018 Information and documentation. Dublin Core Metadata Element Set. Part 1. Basic elements (ISO 15836-1: 2017, IDT). (in Ukrainian)
Weibel S.L., Koch T. The Dublin core metadata initiative. D-lib magazine. 2000. 6(12). P. 1082-9873. CrossRef
Rogushina J. The use of thesauri to search for complex Web information objects based on ontologies. Problems of programming. 2019. № 4, P. 11-27. (in Ukrainian) CrossRef
Gladun A., Rogushina J. Semantic technologies: principles and practices. 2016. Kyiv. ADEF-Ukraine. 308 p. (in Ukrainian)
Gladun A., Rogushina J. Data Mining: search for knowledge in data. 2016. Kyiv. ADEF-Ukraine. 452 p. (in Ukrainian)
Nigro H.O. ed. Data Mining with Ontologies: Implementations, Findings, and Frameworks: Implementations, Findings, and Frameworks. IGI Global. 2007. 289 p. CrossRef
Kosala R., Blockeel H. Web mining research: A survey. ACM Sigkdd Explorations Newsletter. 2000. 2(1). P. 1-15. https://arxiv.org/pdf/cs/0011033.pdf CrossRef
Berry M. W., Castellanos M. Survey of text mining. Survey of Text Mining:Clustering, Classification, and Retrieval. Computing Reviews. 2007. 45(9). P. 548. CrossRef
Krötzsch M., Vrandečić D., Völkel M. Semantic MediaWiki. International Semantic Web Conference. 2006. Р. 935-942. https://link.springer.com/content/pdf/10.1007/11926078_68.pdf. CrossRef
MediaWiki. URL: https://www.mediawiki.org/wiki/MediaWiki.
Rogushina J. Analysis of Automated Matching of the Semantic Wiki Resources with Elements of Domain Ontologies. International Journal of Mathematical Sciences and Computing (IJMSC). 2017. Vol. 3. N 3. P. 50-58. URL: http://www.mecs-press.org/ijmsc/ijmsc-v3-n3/IJMSC-V3-N3-5.pdf. CrossRef
DOI: https://doi.org/10.15407/pp2020.04.055
Refbacks
- There are currently no refbacks.