On the issue of automating the workflow design based on algebra-algorithmic and ontological tools

O.M. Ovdii

Abstract


The article presents the concept of a system for automated workflow design based on algebra-algorithmic and ontological tools. The previously developed applied ontology for program design has been expanded with new concepts related to the workflow design. The existing approaches are analysed and architecture for system that under development is proposed. The work of the system is illustrated by the example of the deve­lopment of the Apache Oozie workflow for analyzing large amounts of data on the distributed Apache Hadoop platform. It is shown that the combination of ontology and algebra-algorithmic tools provides significant potential for adaptation, optimization, integration and modification. In addition, the high level of abstraction provided by these tools facilitates the understanding of the designed workflows by both people and software agents, simplifying teamwork, workflow sharing and reuse. The results of a quantitative assessment of the proposed tools indicate that they can contribute to a significant increase in development productivity and reduce labor costs.

Problems in programming 2019; 1: 37-47


Keywords


workflow; ontology; algebra of algorithms; program design and synthesis; distributed computin

References


Wfmc.org. Workflow Management Coalition. [online] Available from: https://www.wfmc. org/ [Accessed 29 Jan. 2019].

Amin, K., Laszewski, G., Hategan, M., Zaluzec, N. J., Hampton, S. & Rossi, A. (2004). GridAnt: A Client-Controllable Grid Workflow System. In: 37th Hawaii International Conference on System Sciences. HI, USA. P. 3293–3301. CrossRef

Doroshenko A.Yu., Beketov O.G., Ivaniv R.B., Iovchev V.O., Myronenko I.O. & Yatsenko O.A. (2015) Automated generation of parallel programs for graphics processing units based on algorithm schemes. Problems in programming. (1). P. 19–28. (in Ukrainian).

Andon P.I., Doroshenko A.Yu., Beketov O.G., Iovchev V.O. & Yatsenko O.A. (2015) Software tools for automation of parallel programming on the basis of algebra of algorithms. Cybernetics and systems analysis. (1). P. 162–170. (in Russian). CrossRef

Doroshenko A.Yu., Ivanenko P.A., Ovdii O.M., & Yatsenko O.A. (2016) Automated design of programs for solving the task of meteorological forecasting. Problems in programming. (1). P. 102–115. (in Ukrainian).

Andon P.I. et al. (2007) Algebra-algorithmic models and methods of parallel programming. Kiev: Academperiodika. (in Russian).

Doroshenko A.Yu. & Yatsenko O.A. (2006) About the synthesis of Java programs by algebra-algorithmic specifications. Problems in programming. (4). P. 58–70. (in Russian).

Yatsenko O.A. (2013) Integration of algebra-algorithmic tools and term rewriting for efficient parallel programs development. Problems in programming. (2). P. 62–70. (in Russian).

Doroshenko A.Yu., Beketov O.G. Yatsenko O.A., Pavliuchyn T.O. & Vitriak I.A. (2014) Development of the service-oriented soft-ware for launching parallel programs on a multiprocessor cluster. Problems in programming. (4). P. 3–14. (in Ukrainian).

Doroshenko A.Yu., Ovdii O.M. & Yatsenko O.A. (2017) Ontological and algebra-algorithmic tools for automated design of parallel programs for cloud platforms. Cybernetics and Systems Analysis. 53(2). P. 181–192. (in Russian). CrossRef

OWL 2 Web Ontology Language Primer (Second Edition). [online] Available from: https://www.w3.org/2012/pdf/REC-owl2-primer-20121211.pdf [Accessed 29 Jan. 2019].

Gruber T.R. (1993) A Translation Approach to Portable Ontologies. Knowledge Acquisition. 5(2). P. 199–220. CrossRef

Strmecki D., Magdalenic I. & Kermek D. (2016) An Overview on the use of Ontologies in Software Engineering. Journal of Computer Science. 12(12). P. 597–610. CrossRef

Beco S., Cantalupo B., Giammarino L., Matskanis N. & Surridge M. (2005) OWL-WS: A Workflow Ontology for Dynamic Grid Service Composition. In: 1st Int. Conf. on e-Science and Grid Computing. IEEE Computer Society. P. 148–155. CrossRef

Oliveira D., Ogasawara E., Araujo Baiao F. & Mattoso M. (2011) Adding Ontologies to Scientific Workflow Composition. In: XXVI Simpósio Brasileiro de Banco de Dados. Florianópolis, SC. P. 147–154.

Pinheiro da Silva, P., Salayandia L. & Gates A.Q. (2007) WDO-It! A Tool for Building Scientific Workflows from Ontologies. Technical Report UTEP-CS-07-XX, University of Texas. [online] Available from: http://digitalcommons.utep.edu/cs_techrep/201 [Accessed 29 Jan. 2019].

Horridge M. (2011) A practical guide to building OWL ontologies using Protégé 4 and CO-ODE tools. Manchester: The University Of Manchester. 2011.

Ncdc.gov. National Climatic Data Center (NCDC). [online] Available from: https://www.ncdc.noaa.gov/ [Accessed 29 Jan. 2019].

Hadoop.apache.org. Apache Hadoop Official Website. [online] Available from: http://hadoop.apache.org/ [Accessed 29 Jan. 2019].

Oozie.apache.org. Apache Oozie Workflow Scheduler for Hadoop Official Website. [online] Available from: http://oozie.apache.org/ [Accessed 29 Jan. 2019].

Pig.apache.org. Apache Pig Official Website. [online] Available from: http://pig.apache.org/ [Accessed 29 Jan. 2019].

R-project.org. The R Project for Statistical Computing Official Website. [online] Available from: https://www.r-project.org/ [Accessed 29 Jan. 2019].

Ovdii, O.M. (2018) Extension of the program synthesis system to analyze large data sets. Problems in programming. (2-3). P. 68–74. (in Ukrainian).

Nguyen V., Deeds-Rubin S., Tan T., Boehm B.A. SLOC Counting Standard. [online] Available from: http://csse.usc.edu/ TECHRPTS/2007/usc-csse-2007-737/usc-csse-2007-737.pdf. [Accessed 29 Jan. 2019].




DOI: https://doi.org/10.15407/pp2019.01.037

Refbacks

  • There are currently no refbacks.