JUCS - Journal of Universal Computer Science 15(18): 3325-3342, doi: 10.3217/jucs-015-18-3325
Robust Extraction of Text from Camera Images using Colour and Spatial Information Simultaneously
expand article infoShyama Prosad Chowdhury, Soumyadeep Dhar§, Karen Rafferty, Amit Kumar Das|, Bhabatosh Chanda
‡ Queen's University Belfast, Belfast, United Kingdom§ Videonetics Technology Pvt. Ltd., Kolkata, India| Bengal Engineering and Science University, Shibpur, India¶ Indian Statistical Institute, Kolkata, India
Open Access
The importance and use of text extraction from camera based coloured scene images is rapidly increasing with time. Text within a camera grabbed image can contain a huge amount of meta data about that scene. Such meta data can be useful for identification, indexing and retrieval purposes. While the segmentation and recognition of text from document images is quite successful, detection of coloured scene text is a new challenge for all camera based images. Common problems for text extraction from camera based images are the lack of prior knowledge of any kind of text features such as colour, font, size and orientation as well as the location of the probable text regions. In this paper, we document the development of a fully automatic and extremely robust text segmentation technique that can be used for any type of camera grabbed frame be it single image or video. A new algorithm is proposed which can overcome the current problems of text segmentation. The algorithm exploits text appearance in terms of colour and spatial distribution. When the new text extraction technique was tested on a variety of camera based images it was found to out perform existing techniques (or something similar). The proposed technique also overcomes any problems that can arise due to an unconstraint complex background. The novelty in the works arises from the fact that this is the first time that colour and spatial information are used simultaneously for the purpose of text extraction.
text extraction, text localisation, camera image, video frame, discrete edge boundary