International Journal of Engineering and Advanced Technology (IJEAT)
ISSN: 2249 – 8958, Volume-9 Issue-1, October 2019
2577
Published By:
Blue Eyes Intelligence Engineering
& Sciences Publication
Retrieval Number: A9348109119/2019©BEIESP
DOI: 10.35940/ijeat.A9348.109119
MT Embedded E-Learning in India - Challenges for
NLP/AI
Ritu Nidhi, Tanya Singh, D.K. Lobiyal
Abstract -Indian languages are spoken by more than 90% of
its population while most of the higher education happens in
English medium. The policy makers in Indian government have
realized that by introducing multilingual education
electronically, they can reach out to the remotest corner of India
and educate all in their mother tongue. The New Education
Policy (NEP) draft just released by the government puts a heavy
focus on mother tongues in education. The recent initiatives and
focus on Natural Language Processing/ Artificial Intelligence
(NLP/AI) in education through e-learning is not too surprising in
this context. The paper presents the current initiatives in these
directions by the government of India, surveys available NLP
technologies particularly those for automatic translation of
educational content developed by academia and industry and
focuses on the Maithili language community. India's education
needs are diverse and the success of e-learning depends heavily
on the availability of necessary NLP tools in all languages.
Almost all of major scheduled Indian languages are considered
"resource-poor". While some of these languages may have the
basic tools, they lack quality translation tools for delivering
education in native language. The situation is more challenging
in those languages where even the most basic resources and tools
do not exist. Maithili - a language of Bihar and Nepal is such a
language. The paper also presents an effort to develop MT
resources and tools for Maithili and its application in delivering
multilingual content for education.
Keyword- E-learning, NLP, MT, Indian Language, Maithili
I. INTRODUCTION
India is one of the fastest growing markets in multilingual
education technology because there is a need to reach out to
the remotest regions of India covering a vast multilingual
and socio cultural diversity. As per 2011 census report, there
are 26 % of Indian population who are illiterate. The
inherent socio cultural political diversity, exclusion,
complexity etc in Indian education have been nicely
presented by [1]. The potential growth, and opportunities for
business has been summed up by [2], [3], and [4]. India's
language scene can be described as follows - a vast
multilingual and multicultural country with 22 national but
resource poor languages and more than 1600 other
languages with less that10 % English knowing population
and significant illiteracy. This makes India a land of
opportunities in the area of using language technology in
education. Of the 22 scheduled languages, Hindi has the
dual status of National and Official Language (NOL) and
English as Associate Official Language (AOL).
Revised Manuscript Received on October 15, 2019
Ritu Nidhi, scholar at Amity University, Noida, India
Dr. Tanya Singh, Professor and Dy. Director (Academics), Amity
School of Engineering & Technology, Amity University, Uttar Pradesh.
Dr. DK Lobiyal, Professor, School of Computer & Systems Sciences,
Jawaharlal Nehru University, New Delhi.
All other 21 scheduled Indian languages enjoy national
status but do not have sufficient language technology
resources to compete with English. Factors like variability,
language mixing, rich morphology, weak syntax, lack of
technology standards have contributed to poor development
of computer technologies in Indian languages. Maithili is
one of the 22 schedule Indian languages spoken in the state
of Bihar and neighboring Nepal and is extremely resource
poor in terms of language technology resources. Lack of
digital resources has resulted in a near absence of computer
technologies in this language.
A.Landscape of Maithili
Maithili (ISO 639-3) is an Indo Aryan language spoken in
Bihar and Nepal. It is also spoken in neighboring states of
Jharkhand and West Bengal. As per the census of India
(2011), we have more than 13 million Maithili speakers in
India alone. Maithili is also spoken in the neighboring
country Nepal where it is the second official language and
the second most spoken language. A rough estimate puts the
total number of Maithili speakers in the world to be around
34 million
1
. Popularly written in Devanagari, Maithili has its
own script called Mithilakshara. Maithili literature dates
back to 8th century AD and has at least six varieties [5].
Maithili shows typical linguistic features of an eastern Indo
Aryan language. With a rich inflection system, absence of
agreement at the level of verb and reasonable scrambling
due to weaker syntax. Maithili has its unique features like
varying layers of pronominal and verbal honorifics. Maithili
has typical Indo Aryan Subject-Object-Verb (SOV) word
order, however flexibility in moving syntactic constituents
with relative ease poses challenges for automated linguistic
analyses [5].
Indo Aryan languages of north India can easily be contrasted
with English which is a rigid syntax SVO language with
little scope for scrambling. English also exhibits weaker
morphology compared to Maithili. These and other
contrasting features of the two languages create difficulties
for automatic translation.
II. E-LEARNING IN INDIA
India's education is based on the colonial model established
by Lord Macaulay during the British administration of India.
Successive post independence governments in India have
tried their best to adapt it to Indian diversity, but were not
successful given the complexity and challenges therein. The
question of languages, medium of instruction etc have been
more difficult to implement in India than the content.
Though the literacy rates have risen sharply, still there is a
massive 26% population below this threshold.
The 2001 and 2011 Census Report published by
Government of India present contrasting literacy figures.
1
https://www.ethnologue.com/language/mai