Automatically Deciding if a Document was Scanned or Photographed

Gabriel Pereira e Silva; Rafael Lins; Brenno Miro; Steven Simske; Marcelo Thielo

doi:10.3217/jucs-015-18-3364

JUCS - Journal of Universal Computer Science 15(18): 3364-3375, doi: 10.3217/jucs-015-18-3364

Automatically Deciding if a Document was Scanned or Photographed

Gabriel Pereira e Silva^‡, Rafael Dueire Lins^§, Brenno Miro^§, Steven J. Simske^|, Marcelo Thielo^¶

‡ ederal University of Pernambuco, Recife, Brazil§ Federal University of Pernambuco, Recife, Brazil| HP Labs, Fort Collins, United States of America¶ HP Labs, Porto Alegre, Brazil

Corresponding author: Gabriel Pereira e Silva ( gfps@cin.ufpe.br )

This article is freely available under the J.UCS Open Content License.

Citation: Silva GP, Lins RD, Miro B, Simske SJ, Thielo M (2009) Automatically Deciding if a Document was Scanned or Photographed. JUCS - Journal of Universal Computer Science 15(18): 3364-3375. https://doi.org/10.3217/jucs-015-18-3364

Abstract

Portable digital cameras are being used widely by students and professionals in different fields as a practical way to digitize documents. Tools such as PhotoDoc enable the batch processing of such documents, performing automatic border removal and perspective correction. A PhotoDoc processed document and a scanned one look very similar to the human eye if both are in true color. However, if one tries to automatically binarize a batch of documents digitized from portable cameras compared to scanners, they have different features. The knowledge of their source is fundamental for successful processing. This paper presents a classification strategy to distinguish between scanned and photographed documents. Over 16,000 documents were tested with a correct classification rate of over 99.96%.

Keywords

MPEG-7, content-based multimedia retrieval, hypermedia systems, Web-based services, XML, semantic web, multimedia