LANGUAGE IN INDIA

Strength for Today and Bright Hope for Tomorrow

Volume 25:6 June 2025
ISSN 1930-2940

Editors:
         Sam Mohanlal, Ph.D.
         B. Mallikarjun, Ph.D.
         A. R. Fatihi, Ph.D.
         G. Baskaran, Ph.D.
         T. Deivasigamani, Ph.D.
         Pammi Pavan Kumar, Ph.D.
         Soibam Rebika Devi, M.Sc., Ph.D.

Managing Co-Editors & Publishers: Selvi M. Bunce, M.A., Ph.D. Candidate and Nathan Mulder Bunce, M.A., Ph.D. Candidate
Published monthly in honor of M.S. Thirumalai, Ph.D. (1940-2025)

Celebrate India!
Unity in Diversity!!

HOME PAGE

Click Here for Back Issues of Language in India - From 2001

Poetic Encounter
Available in https://www.amazon.in/dp/B09TT86S4T

Poems
Naked: the honest musings of two brown women
Available in https://www.amazon.in

Decrees
Available in https://www.amazon.com




BOOKS FOR YOU TO READ AND DOWNLOAD FREE!


REFERENCE MATERIALS

BACK ISSUES


  • E-mail your articles and book-length reports in Microsoft Word to languageinindiaUSA@gmail.com.
  • PLEASE READ THE GUIDELINES GIVEN IN HOME PAGE IMMEDIATELY AFTER THE LIST OF CONTENTS.
  • Your articles and book-length reports should be written following the APA, MLA, LSA, or IJDL Stylesheet.
  • The Editorial Board has the right to accept, reject, or suggest modifications to the articles submitted for publication, and to make suitable stylistic adjustments. High quality, academic integrity, ethics and morals are expected from the authors and discussants.

Copyright © 2025
M. S. Thirumalai

Publisher: Selvi M. Bunce, M.A., Ed.D. Candidate and Nathan Mulder Bunce, M.A., Ph.D. Candidate
11249 Oregon Circle
Bloomington, MN 55438
USA


Custom Search

Maithili Language Technology: A Survey

Shantanu Kumar, Ph.D. and
Narayan Choudhary, Ph.D.


Abstract

Maithili is an Indo-Aryan language primarily spoken in the Indian states of Bihar, Jharkhand, and the lower Terai regions of Nepal. While there has been some recent progress in Maithili language technology, the field is still nascent, with limited resources and research, and there is still a long way to go before we have robust and reliable language processing tools. More research and resources are needed to develop tools such as spell-checkers, text-to-speech systems, and machine translation models, which can help unlock the language's full potential in the digital age. This paper presents an effort made to survey the tools, systems, or research available in the field to facilitate language technology systems in Maithili. In this paper, the technological development of Maithili has been classified under certain heads, and the research works completed, along with the ones being undertaken, have been discussed. Different language technology systems, including the corpus creation and development presented for other languages, and their status for Maithili, have been investigated in this paper.

Keywords: Maithili, language technology, Corpus, ASR, TTS, MT, low-resource languages

Introduction

Maithili is an Indo-Aryan language primarily spoken in the Indian states of Bihar, Jharkhand, and parts of Nepal. While there has been some recent progress in Maithili language technology, the field is still nascent, with limited resources and research.

One of the significant challenges in developing Maithili language technology is the lack of a standardized orthography. The language has been written using a variety of scripts, including Devanagari, Kaithi, and Tirhuta. This lack of standardization makes it difficult to develop tools such as spell-checkers and automated text-to-speech systems. Despite these challenges, some work has been done, discussed in the ensuing sections of the paper, in developing resources for Maithili language processing.

Although some progress has been made in Maithili language technology, there is still a long way to go before we have robust and reliable tools for processing the language. More research and resources are needed to develop tools such as spell-checkers, text-to-speech systems, and machine translation models, which can help to unlock the full potential of the language in the digital age.


This is only the beginning part of the article. PLEASE CLICK HERE TO READ THE ENTIRE ARTICLE IN PRINTER-FRIENDLY VERSION.


Shantanu Kumar, Ph.D.
Central Institute of Indian Languages Mysore
shantanuk.ciil@gmail.com
&
Narayan Choudhary, Ph.D.
Central Institute of Indian Languages Mysore
nchoudhary.ciil@gmail.com

Custom Search


  • Click Here to Go to Creative Writing Section

  • Send your articles
    as an attachment
    to your e-mail to
    languageinindiaUSA@gmail.com.
  • Please ensure that your name, academic degrees, institutional affiliation and institutional address, and your e-mail address are all given in the first page of your article. Also include a declaration that your article or work submitted for publication in LANGUAGE IN INDIA is an original work by you and that you have duly acknowledged the work or works of others you used in writing your articles, etc. Remember that by maintaining academic integrity we not only do the right thing but also help the growth, development and recognition of Indian/South Asian scholarship.