LANGUAGE IN INDIA

Strength for Today and Bright Hope for Tomorrow

Volume 11 : 5 May 2011
ISSN 1930-2940

Managing Editor: M. S. Thirumalai, Ph.D.
Editors: B. Mallikarjun, Ph.D.
         Sam Mohanlal, Ph.D.
         B. A. Sharada, Ph.D.
         A. R. Fatihi, Ph.D.
         Lakhan Gusain, Ph.D.
         Jennifer Marie Bayer, Ph.D.
         S. M. Ravichandran, Ph.D.
         G. Baskaran, Ph.D.
         L. Ramamoorthy, Ph.D.


HOME PAGE



BOOKS FOR YOU TO READ AND DOWNLOAD FREE!


REFERENCE MATERIAL

BACK ISSUES


  • E-mail your articles and book-length reports in Microsoft Word to languageinindiaUSA@gmail.com.
  • Contributors from South Asia may e-mail their articles to
    B. Mallikarjun,
    Central Institute of Indian Languages,
    Manasagangotri,
    Mysore 570006, India
    mallikarjun@ciil.stpmy.soft.net.
  • PLEASE READ THE GUIDELINES GIVEN IN HOME PAGE IMMEDIATELY AFTER THE LIST OF CONTENTS.
  • Your articles and book-length reports should be written following the APA, MLA, LSA, or IJDL Stylesheet.
  • The Editorial Board has the right to accept, reject, or suggest modifications to the articles submitted for publication, and to make suitable stylistic adjustments. High quality, academic integrity, ethics and morals are expected from the authors and discussants.

Copyright © 2010
M. S. Thirumalai


Custom Search

Named Entity Recognition and Transliteration for Telugu Language

Kommaluri Vijayanand and R. P. Seenivasan


1. Introduction

The concept of transliteration is a wonderful art in Machine Translation. The translation of named entities is said to be transliteration. Transliteration should not be confused with translation, which involves a change in language while preserving meaning. Transliteration performs a mapping from one alphabet into another. In a broader sense, the word transliteration is used to include both transliteration at the micro level and transcription.

Transliteration is a process in which words in one alphabet are represented in another alphabet. There are a number of rules which govern transliteration between different alphabets, designed to ensure that it is uniform, allowing readers to clearly understand transliterations. Transliteration is not quite the same thing as transcription, although the two are very similar; in transcription, people represent sounds with letters from another alphabet, while in transliteration, people attempt to map letters over each other, sometimes with accent marks or other clues to suggest particular sounds.

As we say technically the transliteration is the process of transforming the text in one writing system (Source language) to another writing system (Target Language) without changing its pronunciation. Transliteration is a very good asset for machine translation. Machine translation cannot translate some of the text. Because, there could not be correspond translation word in the bilingual dictionary. Those words are called out of vocabulary words (OOV). To overcome this OOV problem transliteration came into being. The transliteration involves the process of converting the character sequence in the source language to target language on the basis of how the characters are pronounced in source language.

Transliteration needs knowledge of characters in source and target language. Since the pronunciation is the aim goal of transliteration it is difficult to give exact transliteration. Because, the pronunciation of single character of the source language can have multiple character in the target language as the transliteration is done by character wise. In transliteration so far we can give possible transliterations and yet it is the great challenge to the researchers to give exact transliteration in target language.


This is only the beginning part of the article. PLEASE CLICK HERE TO READ THE ARTICLE IN PRINTER-FRIENDLY VERSION.


Kommaluri Vijayanand
kvixs@yahoo.co.in

R. P. Seenivasan
rpsv@yahoo.com

Department of Computer Science
School of Engineering and Technology
Pondicherry University
Puducherry – 605 014
India

Custom Search


  • Click Here to Go to Creative Writing Section

  • Send your articles
    as an attachment
    to your e-mail to
    languageinindiaUSA@gmail.com.
  • Please ensure that your name, academic degrees, institutional affiliation and institutional address, and your e-mail address are all given in the first page of your article. Also include a declaration that your article or work submitted for publication in LANGUAGE IN INDIA is an original work by you and that you have duly acknowledged the work or works of others you either cited or used in writing your articles, etc. Remember that by maintaining academic integrity we not only do the right thing but also help the growth, development and recognition of Indian scholarship.