JUCS - Journal of Universal Computer Science 14(18): 3031-3050, doi: 10.3217/jucs-014-18-3031
Comparative Aspects between the Cluster and Grid Implementations of BigBatch
expand article infoGiorgia de Oliveira Mattos, Andrei de Araújo Formiga§, Rafael Dueire Lins, Francisco Heron de Carvalho Júnior|, Fernando Mário Junqueira Martins
‡ Federal University of Pernambuco, Recife, Brazil§ Universidade Federal de Pernambuco, Brazil| Universidade Federal do Ceará, Fortaleza, Brazil¶ Universidade do Minho, Braga, Portugal
Open Access
Abstract
BigBatch is an image processing environment designed to process batches of thousands of monochromatic documents. One of the flexibilities and pioneer aspects of BigBatch is offering the possibility of working in distributed environments such as clusters and grids. This paper presents an overview of BigBatch image processing features and analyzes the results of a number of experiments devised to compare its cluster and grid configurations. Although preliminary results were published earlier on, the new data shown here that sheds new lights onto this aspect. The results obtained exhibit almost no difference in total execution times for some grid and cluster configurations, but significant differences for others, indicating that the choice between such configurations must take into account a number of details in order to reach peak performance. Besides those, there are other qualitative aspects that may impact this choice. This paper analyzes these aspects and provides a general picture of how to successfully use BigBatch to process document images employing computers in parallel for this task.
Keywords
cluster, grid, image processing, load-balancing