Big data metadata classification

O.V. Zakharova

Abstract


Now there are a lot of data of different structure (or not structured at all) and origin, their volumes are growing ex­ponentially. The problem is the existing software and hardware are not able to cope with so many dif­fe­rent types of data appearing with great speed. Big Data has become too complex and dynamic to pro­cess, store, analyze and manage with traditional tools. It caused the appearance of new platforms and ap­pro­a­ches for working with data, and at the same time, an understanding of the fact that to solve big data prob­lems, these raw data must be supplemented with me­ta­data. Metadata in this case is a means of classifying, organizing, and characterizing data and its content. Their main advantage is an ordered structure. Due to it, metadata is readable not only by a person, but also by a computer. Thus, they can be pro­ces­sed auto­ma­ti­cal­ly and used for indexing, searching, com­bining, auto­mated processing, classification of big data, etc. The creation of effective metadata management sys­tems, first of all, requires their coordinated general classification that take into account the types of data sour­ces (methods of their obtaining) that form the con­tent, tasks solved at different stages of the life cycle, existing formats of data presentation, principles of re­a­sonable efficiency, since often metadata size sig­ni­fi­can­tly exceeds the amount of described data (even big). Therefore, the aim of this work is to analyze exis­ting sources of big data, methods for creating and processing the corresponding metadata, as well as software products that allow them to be processed in a certain way, and building the classification of me­ta­da­ta on the basis of the analysis.

Problems in programming 2019; 4: 53-74


Keywords


big data source; metadata managment; Hadoop; metadata classification; metadata analysis; services for processing metadata; creation; reviewing; editing of metadata; metadata of images; metadata of audio files; metadata of video files; Data Warehouse metad

References


https://habr.com/ru/post/93119/

https://www.exif.org/category/specifications

http://exif.org/dcf.PDF

https://helpx.adobe.com/after-effects/using/xmp-metadata.html

https://www.dublincore.org/specifications/dublin-core/dces/

ISO 16684-1:2012, Graphic technology – Extensible metadata platform (XMP) specification – Part 1: Data model, serialization and core properties

https://www.adobe.com/devnet/xmp.html

https://forum.allnokia.ru/viewtopic.php?t=51934

https://habr.com/ru/post/103635/

http://id3.org/id3v2.3.0

http://www.xspf.org/xspf-v0.html

https://www.ibm.com/support/knowledgecenter/ru/SS88XH_1.6.0/iva/ov_metadata.html

https://mediaarea.net/AVIMetaEdit/tech_view_help

https://stackoverflow.com/questions/2075175/is-there-a-standard-schema-for-video-metadata

https://schema.org/

https://studref.com/379466/menedzhment/metadannye_dokumentov

http://prgssr.ru/development/posty-dannye-i-metadannye-v-wintersmith.html#heading-section-1

https://symfony.com.ua/doc/current/components/yaml/yaml_format.html

https://uk.wikipedia.org/wiki/YAML

https://frontender.info/like-able-content-spread-your-message-with-third-party-metadata/

https://www.ixbt.com/soft/audio-tag-editors.shtml

http://ogp.me/

https://hostenko.com/wpcafe/plugins/kak-nastroit-open-graph-i-twitter-karty-dlja-wordpress/

https://blogs.loc.gov/loc/2010/04/how-tweet-it-is-library-acquires-entire-twitter-archive/

https://www.oncrawl.com/oncrawl-seo-thoughts/a-complete-guide-to-twitter-cards/

https://www.datasciencecentral.com/profiles/blogs/importance-of-metadata-in-a-big-data-world

http://iso.ru/ru/press-center/journal/2122.phtml

https://fotoforensics.com/tutorial-meta.php

https://www.exif.org/Exif2-2.PDF

http://www.belursus.info/soft/i.php?c=exiftool

https://webznam.ru/blog/metadannye_fajlov_fotografij/2015-04-01-135

https://helpx.adobe.com/ru/premiere-pro/using/metadata.html

https://www.stopfake.org/metadannye-nevidimaya-informatsiya-o-fotografii/

https://ms.detector.media/ mediaprosvita/how_to/ 13_onlayninstrumentiv_dlya_perevirki_kontentu

https://developers.facebook.com/docs/sharing/webmasters/ optimizing?locale=ru_RU




DOI: https://doi.org/10.15407/pp2019.04.053

Refbacks

  • There are currently no refbacks.