International Journal of Information & Computation Technology.
ISSN 0974-2239 Volume 4, Number 17 (2014), pp. 1839-1845
© International Research Publications House
http://www. irphouse.com
Recognition of Gurmukhi Text from Sign Board Images
Captured from Mobile Camera
Shilpa Arora
1
, Dharamveer Sharma
2,
Silky Arora
3
1
Department of Computer Science, Punjabi University, Patiala (Punjab)
2
Assistant Professor, Department of Computer Science,
Punjabi University, Patiala (Punjab)
3
Department of Computer Science, Punjabi University, Patiala (Punjab)
1
arorashilpa69@yahoo.com,
2
dveer72@hotmail.com,
3
arorasilky08@gmail.com
Abstract
This paper presents recognition of Gurmukhi text from sign board images
which are captured through mobile phone camera. The images are binarized
and noise free. This consists of three stages. In first step the extracted text is
segmented into characters. In second step the features which uniquely classify
the characters are extracted using Zoning. In third stage the classifier SVM is
used to recognize the text.
Keywords: SVM, classifier, recognition, Gurmukhi, images
1. Introduction
Automatic text recognition from images receives a growing attention because of
potential applications in image retrieval, robotics and intelligent transport system, etc.
In addition, extraction and recognition of texts in images is useful to blind and
foreigners with language barrier as well. However, developing a robust scheme for
extraction and recognition of texts from camera captured image is a great challenge
due to several factors which include variations of style, color, spacing, distribution
and alignment of texts, background complexity, influence of luminance, and so on. A
large number of algorithms have been proposed in the literature to cope with these
issues. Work for the development of complete OCR systems for Indian language
scripts is major field of research. Research in the field of recognition of Gurmukhi
script faces major problem mainly related to the unique characteristics of the script
like connectivity of characters on the headline, characters in a word present in both
horizontal and vertical directions, two or more characters in a word having
intersecting minimum bounding rectangles along horizontal direction, existence of a