JUCS - Journal of Universal Computer Science 27(4): 387-412, doi: 10.3897/jucs.67046
Integration Model between Heterogeneous Data Services in a Cloud
expand article infoMarcelo Aires Vieira, Elivaldo Lozer Fracalossi Ribeiro§, Daniela Barreiro Claro|, Babacar Mane
‡ Federal University of Bahia, Salvador, Brazil§ Federal University of Southern Bahia, Porto Seguro, Brazil| Federal University of Bahia, Salvador - Bahia, Brazil
Open Access
Abstract

With the growth of cloud services, many companies have begun to persist and make their data available through services such as Data as a Service (DaaS) and Database as a Service (DBaaS). The DaaS model provides on-demand data through an Application Programming Inter- face (API), while DBaaS model provides on-demand database management systems. Different data sources require efforts to integrate data from different models. These model types include unstructured, semi-structured, and structured data. Heterogeneity from DaaS and DBaaS makes it challenging to integrate data from different services. In response to this problem, we developed the Data Join (DJ) method to integrate heterogeneous DaaS and DBaaS sources. DJ was described through canonical models and incorporated into a middleware as a proof-of-concept. A test case and three experiments were performed to validate our DJ method: the first experiment tackles data from DaaS and DBaaS in isolation; the second experiment associates data from different DaaS and DBaaS through one join clause; and the third experiment integrates data from three sources (one DaaS and two DBaaS) based on different data type (relational, NoSQL, and NewSQL) through two join clauses. Our experiments evaluated the viability, functionality, integration, and performance of the DJ method. Results demonstrate that DJ method outperforms most of the related work on selecting and integrating data in a cloud environment.

Keywords
Cloud Computing, Data Integration, DaaS, DBaaS