Analysis of formal models and standards for structured electronic document in corporate informational system

A.V. Sharypanov, I.E. Shchetynin, S.M. Ivanov

Abstract


Permanent acceleration of development of information technologies and the resulting changes in typical business processes of organizations create new forms of complexity of electronic document flow while creating modern corporate information systems. The main task in mapping structured documents to electronic form is to define the universal mechanism of document markup and to determine its specific information fragments for correct interpretation in software applications. A comparative overview of the existing languages for describing structured electronic documents and the standards on which they are based is provided. The advantages and disadvantages of their use for addressing typical tasks of electronic document flow organization that arise while developing corporate information systems are determined.

Problems in programming 2018; 1: 128-146


Keywords


structured electronic document; electronic document flow; corporate information system

References


Goldfarb C. F. The Roots of SGML – A Personal Recollection. 1996. [Electronic resource]. Mode of access: http://www.sgmlsource.com/history/roots.htm

Goldfarb C. F. Design Considerations for Integrated Text Processing Systems. IBM. Cambridge Scientific Center Technical Report G320–2094. 1973. [Electronic resource]. Mode of access: https://web.archive.org/web/20040627050036/http://www.sgmlsource.com:80/history/G320-2094/G320-2094.htm

Information processing -- Text and office systems -- Standard Generalized Markup Language (SGML) : ISO 8879:1986. International Organization for Standardization. 1986. [Electronic resource]. Mode of access: https://www.iso.org/standard/16387.html

Berners–Lee T. Information Management: A Proposal. CERN. 1989. [Electronic resource]. Mode of access: http://www.w3.org/History/1989/proposal.html

Berners–Lee T. Hypertext Markup Language – 2.0 : RFC1866. November 1995. [Electronic resource]. Mode of access: http://www.rfc-editor.org/rfc/rfc1866.txt

Clark J.Comparison of SGML and XML : World Wide Web Consortium Note 15–December–1997. W3C. 1997. [Electronic resource]. Mode of access: http://www.w3.org/TR/NOTE–sgml–xml–971215

Berjon R. HTML5 : A vocabulary and associated APIs for HTML and XHTML : W3C Recommendation 28 October 2014. W3C. 2014. [Electronic resource]. Mode of access: http://www.w3.org/TR/html5/

Bray T. Extensible Markup Language (XML) 1.1 (Second Edition) : W3C Recommendation 16 August 2006, edited in place 29 September 2006. W3C. 2006. [Electronic resource]. Mode of access: http://www.w3.org/TR/2006/REC-xml11-20060816/

Pemberton S. XHTML™ 1.0 The Extensible HyperText Markup Language (Second Edition) : A Reformulation of HTML 4 in XML 1.0 : W3C Recommendation 26 January 2000, revised 1 August 2002. W3C. 2002. [Electronic resource]. Mode of access: http://www.w3.org/TR/xhtml1/

Pemberton S. XHTML2 Working Group Home Page. W3C. 2007. [Electronic resource]. Mode of access: http://www.w3.org/MarkUp/

HTML : Living Standard — Last Updated 22 November 2017. [WHATWG]. [2006?]. [Electronic resource]. Mode of access: https://html.spec.whatwg.org/multipage/

Adobe Acrobat 1.0 - Product brochure. Adobe Systems. 1993. [Electronic resource]. Mode of access: http://www.planetpdf.com/planetpdf/pdfs/adobe_acrobat1_broch.pdf

Warnock J. The Camelot Project. 1990. [Electronic resource]. Mode of access: http://www.planetpdf.com/planetpdf/pdfs/warnock_camelot.pdf

Wootton A.T. How was the PDF format created?. Quora. 2014. [Electronic resource]. Mode of access: https://www.quora.com/PDF-file-format/How-was-the-PDF-format-created

Johnson D. Is PDF an Open Standard?. 2010. [Electronic resource]. Mode of access: https://talkingpdf.org/is-pdf-an-open-standard/

Document management – Portable document format -- Part 1: PDF 1.7 : ISO 32000-1:2008 [Електронний ресурс]. – International Organization for Standardization. – 2008. – Режим досту-пу: https://www.iso.org/standard/51502.html. – Назва з екрана.

Document management – Portable document format -- Part 2: PDF 2.0 : ISO 32000-2:2017. International Organization for Standardization. 2017. [Electronic resource]. Mode of access: https://www.iso.org/standard/63534.html

Document management -- Electronic document file format for long-term preservation -- Part 1: Use of PDF 1.4 (PDF/A-1) : ISO 19005-1:2005. International Organization for Standardization. 2005. [Electronic resource]. Mode of access: https://www.iso.org/standard/38920.html

Johnson D. White Paper: How to Implement PDF/A. 2010. [Electronic resource]. Mode of access: https://talkingpdf.org/white-paper-how-to-implement-pdfa/. – Назва з екрана.

Jones B. Open XML timeline. 2007. [Electronic resource]. Mode of access: https://blogs.msdn.microsoft.com/brian_jones/2007/07/09/open-xml-timeline/

TAC approval on conclusions and recommendations on open document formats. IDABC : [Interoperable Delivery of European eGovernment Services to public Administrations, Businesses and Citizens]. 2006. [Electronic resource]. Mode of access: http://web.archive.org/web/20060720005118/http://ec.europa.eu/idabc/en/document/2592/5588

Sayer P. "ISO Rejects Microsoft's OOXML as Standard". PCWorld : IDG News Service. 2007. [Electronic resource]. Mode of access: http://www.pcworld.com/article/136711/iso_rejects_microsofts_ooxml_as_standard.html

Standard ECMA-376 : Office Open XML File Formats. ECMA International. 2006. [Electronic resource]. Mode of access: http://www.ecma-international.org/publications/standards/Ecma-376.htm

History of OpenDocument. 2006. [Electronic resource]. Mode of access: http://opendocument.xml.org/milestones

Cover R. OpenOffice.org XML File Format. OASIS. 2006. [Electronic resource]. Mode of access: http://xml.coverpages.org/starOfficeXML.html

Durusau P. Developing an XML-based file format specification for office applications. OASIS Open Document Format for Office Applications (OpenDocument) TC. [2005?]. [Electronic resource]. Mode of access: https://www.oasis-open.org/committees/tc_home.php?wg_abbrev=office

Updegrove A. Another Open Document Format – From China. The Standards Blog. 2006. [Electronic resource]. Mode of access: http://www.consortiuminfo.org/standardsblog/article.php?story=2006110806164573

Zhong C. ODF–UOF Converter. 2006. [Electronic resource]. Mode of access: http://odf-to-uof.sourceforge.net/overview.html

Updegrove A. Don't Forget UOF: Here Comes EIOffice 2009 (Updated 2X. The Standards Blog. 2008. [Electronic resource]. Mode of access: http://www.consortiuminfo.org/standardsblog/article.php?story=20080721140512962

Ingerson B. Yet Another Markup Language (YAML) 1.0 : Working Draft 01 Aug 2001. 2001. [Electronic resource]. Mode of access: http://web.archive.org/web/20070217091403/http://yaml.org:80/spec/history/2001-08-01.html

Standard ECMA–262 : ECMAScript 2017 Language Specification. ECMA International. 2017. [Electronic resource]. Mode of access: http://www.ecma-international.org/publications/standards/Ecma-262.htm

Standard ECMA-404 : The JSON Data Interchange Format. ECMA International. 2013. [Electronic resource]. Mode of access: http://www.ecma-international.org/publications/standards/Ecma-404.htm

Information technology -- The JSON data interchange syntax : ISO/IEC 21778:2017. International Organization for Standardization. 2017. [Electronic resource]. Mode of access: https://www.iso.org/standard/71616.html

Ngo T. Office Open XML overview. ECMA International. [2006?]. [Electronic resource]. Mode of access: http://www.ecma-international.org/news/TC45_current_work/OpenXML%20White%20Paper.pdf

Information technology -- Document description and processing languages -- Office Open XML File Formats -- Part 1: Fundamentals and Markup Language Reference : ISO/IEC 29500–1:2016. International Organization for Standardization. 2016. [Electronic resource]. Mode of access: https://www.iso.org/standard/71691.html

Weir R. A technical comparison: ISO/IEC 26300 vs. Microsoft Office Open XML // OpenOffice.org Conference (OOoCon 2006) : September 11 - 13 2006 – Lyon, France. 2006. [Electronic resource]. Mode of access: http://www.openoffice.org/marketing/ooocon2006/presentations/wednesday_o3.pdf

ISO/IEC 26300-1:2015 : Information technology -- Open Document Format for Office Applications (OpenDocument) v1.2 -- Part 1: OpenDocument Schema. International Organization for Standardization. 2015. [Electronic resource]. Mode of access: https://www.iso.org/standard/66363.html

Kunze J. The Dublin Core Metadata Element Set Request for Comments 5013. The IETF Trust. 2007. [Electronic resource]. Mode of access: http://www.ietf.org/rfc/rfc5013.txt

Sustainability of Digital Formats: Planning for Library of Congress Collections. [Library of Congress, U.S.?]. [2017?]. [Electronic resource]. Mode of access: https://www.loc.gov/preservation/digital/formats/fdd/fdd000401.shtml

ISO/IEC TR 29166:2011 : Information technology -- Document description and processing languages -- Guidelines for translation between ISO/IEC 26300 and ISO/IEC 29500 document formats. International Organization for Standardization. 2011. [Electronic resource]. Mode of access: https://www.iso.org/standard/45245.html

Eckert K.-P. Feature Based Document Profiling ‐ a Key For Document Interoperability?. Fraunhofer FOKUS, Berlin. 2012. [Electronic resource]. Mode of access: https://cdn1.scrvt.com/fokus/403d3c76ef3c1e76/ea5963383889/Feature_Based_Document_Profiling.pdf. ISBN 978-3-00-038675-6

Sanaulla M. Parsing XML using DOM, SAX and StAX Parser in Java. 2013. [Electronic resource]. Mode of access: https://sanaulla.info/2013/05/23/parsing-xml-using-dom-sax-and-stax-parser-in-java/

Engel P. Extensible Business Reporting Language (XBRL) 2.1 : Recommendation 31 December 2003 with errata corrections to 20 February 2013. XBRL International. 2013. [Electronic resource]. Mode of access: http://www.xbrl.org/Specification/XBRL-2.1/REC-2003-12-31/XBRL-2.1-REC-2003-12-31+corrected-errata-2013-02-20.html




DOI: https://doi.org/10.15407/pp2018.01.128

Refbacks

  • There are currently no refbacks.