International Journal For Technological Research In Engineering Volume 2, Issue 3, November-2014 ISSN (Online): 2347 - 4718 www.ijtre.com Copyright 2014.All rights reserved. 148 STUDY OF SPELL CHECKING TECHNIQUES AND AVAILABLE SPELL CHECKERS IN REGIONAL LANGUAGES: A SURVEY Er. Sumreet Kaur Randhawa 1 , Er.Charanjiv Singh Saroa 2 1 Student, 2 Assistant professor, Department of computer engineering Punjabi university Patiala, Punjab, India Abstract: Spellchecker is software that analyzes possible misspellings in the text. It is the process of detecting and sometimes providing some suggestions for incorrectly spelled words in a text. If dictionary of spell checker is larger than higher is the error detection and error correction rate. Though considerable work has been done in English language but not much work has been done in regional language of India including Punjabi. Punjabi world’s 12th most widely spoken language. The only available spell checker for Punjabi is AKHAR and SUDHAR. NLP (Natural language processing) is a field of computer science concerned with interaction between computer and human language. In this paper we have discussed different techniques and about different spell checker available in different Indian regional languages and study about all spell checker whose websites are available and check their efficiency through their ranking. In future we will make a Punjabi spell checker. We will combine concept of AI (artificial intelligence) and NLP (natural language processing) to create Punjabi spell checker so that user can get appropriate word for misspelled word. KEYWORDS: Punjabi spell checker, Gurmukhi spell checker, Error detection, Error correction, N-gram, Typing errors, NLP, misspelled words I. INTRODUCTION With the advent of the personal computer, it can be assumed that mistyping of words has increased. Thus, for many of us who do our own typing, the spell checkers in our word processors or other software have become indispensable. Spell Checker has following steps:- 1. Take a word as input from a file. 2. Preprocess it. 3. Check in the word whether that word is available. 4. If it is available then move to next one. 5.If word is not present then spell checker will check the closest match word with it and put it in the form of suggestions. Thus two main issues related to spell checker are error detection and error correction. Spelling errors can be partitioned into different categories, that is, real-word error and non-word error. In this paper I am describing techniques of error detection, error correction, available spell checkers and efficiency of spell checkers in different languages whose websites are available. Error means a measure of the estimated difference between the observed and calculated value Spelling and typing errors are common errors made by humans. Errors may be of missing letters, extra letters, misspelled letters, or disordered letters II. LITERATURE SURVEY A we discuss there are two main issues related to spell checker i.e. error detection and error correction .Further there are two types of errors these are non-word errors and real word errors or errors may be classified as Typographic errors and Cognitive errors. Many techniques are available for non- word errors. The error detection process usually consists of checking to see if an input string is a valid index or dictionary word. Efficient techniques have been devised for detecting such types of errors. The two most known techniques are n-gram analysis and dictionary lookup. Error correction means just to replace the incorrect with most likely corrected word. Techniques available for error correction are Edit distance, Similarity keys, Rule based technique n-gram based technique, neural technique, Probabilistic technique and neural network. Available websites for spell checkers for different languages are Hindikhoj.com, Star21.com, Shabdkosh.com, khandbahale.com, Shuddhoshabdo.com, Spellweb.com. III. IMPORTANCE OF PUNJABI LANGUAGE Language like Punjabi which has literary history older than English is 12 th most largest spoken language of world has now ultimately came to an end. This is due to absence of local languages in educational system because schools and colleges play a necessary role in preserving languages and culture. Almost 110 million people possess Punjabi as their mother tongue but now English language is like punishing language in schools for students and we all know one could better understand things in mother tongue rather than in other language. Although Asian countries like china, japan, are teaching their students in their mother tongue. Suppose in a school science subject is there and rather than understanding concept of science or getting practical knowledge we pay attention to English words used in it. So Punjabi spell checker act as savior of Punjabi language. IV. TECHNIQUES OF SPELL CHECKER A. ERROR DETECTON 1. N GRAM ANAYLSIS N-gram analysis is described as a method to find incorrectly spelled words in text and used for non-word errors. Instead of comparing each entire word in a text to a dictionary, just n-grams are controlled. A check is done by using an n- dimensional matrix where real n-gram frequencies are stored.