LANGUAGE IN INDIA

A Grammar of Malayalam ...
Ravi Sankar S Nair, Ph.D.

The Evolution of Language Laws in Post-Independence India ...
B. Mallikarjun, Ph.D.

Impact of Commercialization on Language with Special Reference to Urdu Lexicon - Doctoral Dissertation ... Somana Fatimah,Ph.D.

Status of English among the Kokborok and Tripura Bangla Learners in Tripura - M.Phi. Dissertation ... Swapan Debnath, M.A., M.Phil., Ph.D.

Communicative Language Teaching Approach at Higher Secondary Level in Bangladesh – Teachers’ Perceptions and Classroom Practice ... Md. Khaled Bin Chowdhury, M.A. (Double)

A Study on Personality Factors Causing Stress among School Teachers
M.Phil. Dissertation ... C. Manjula, M.A., M.Phil., Ph.D.

English Language Teaching Updating the English Classroom with Techniques and Communication Skills
A Book on Current ELT ... Chandrika Mohan, M.A., M.A., M.Phil., C.G.T., Ph.D. Candidate

A Song for the Road-
Wole Soyinka’s Imagery and Tradition ... V. N. Manjula, Ph.D.

A Linguistic Survey of Katta Varadaraju’s Dwipada Ramayanam
A Ph.D. Dissertation in Telugu ... Pammi Pavan Kumar, M.A. (Telugu) M.A. (Linguistics), M. Phil., Ph.D. (Telugu)

Select Speeches of Mrs. Indira Gandhi
English to Tamil
M.Phil. Dissertation ... J. Abiraami, M.A., M.Phil.

Linguistics and Literature, Department of Linguistics Silver Jubilee Volume 1 ... Editors: C. Sivashanmugam, Ph.D., et al.

Recent Advances in Linguistics, Department of Linguistics Silver Jubilee Volume 2 ... Editors: C. Sivashanmugam, Ph.D., et al.

Engineering English: A Critical Evaluation - Ph.D. Dissertation ... Albert P’Rayan, Ph.D.

Kalidasa's Shakuntala and the Doctrine of Rasa ... Tripti Mund, M.A., M.Phil.

Evolving Strategies for Teaching Basic Vocabulary in L2 through Meaningful Input: An Ethnographic Study with First Generation Learners ... Rajakumar Guduru, M.Phil., Ph.D. Scholar

Aspects of Autobiography in Indian Writing in English ... Editors: Pauline Das, Ph.D., K. R. Vijaya Murari, Ph.D., and Charu Sheela, M.A., M.Phil., M.B.A.

Imagery in Donne's Songs and Sonnets ... Fatima Ali al-Khamisi, M.A.

Parsing in Indian Languages ... Editors: Kommaluri Vijayanand and L. Ramamoorthy

English Language Teaching (ELT) in Saudi Arabia: A Study of Learners' Needs Analysis with Special Reference to Community College, Najran University ... Dr. Mohd. Mahib ur Rahman, Ph.D.

Provision for Linguistic Diversity and Linguistic Minorities in India ... Vanishree V.M., MAPL and ELT, M.A., PGDHRM.

Impact of Students' Attitudes on their Achievement in English: A Study in the Yemeni Context ... Hassan Saeed Awadh Ba-Udhan

A Study of B.ED. Students' Attitude Towards Using Internet in Vellore District, Tamilnadu, India ... T. Pushpanathan, M.A., M.Phil., B.Ed.

Development of a Hindi to Punjabi Machine Translation System, A Doctoral Dissertation ... Vishal Goyal, Ph.D.

A Report on the State of Urdu Literacy in India, 2010 ...
Omar Khalidi, Ph.D.

English for Medical Students of Hodeidah University, Yemen - A Pre-sessional Course ...
Arif Ahmed Mohammed Hassan Al-Ahdal, Ph.D. Scholar

Global Perspective of Teaching English Literature in Higher Education in Pakistan ...
Rabiah Rustam, M.S., Ph.D. Candidate

Improving Chemmozhi Learning and Teaching - Descriptive Studies in Classical-Modern Tamil Grammar ...
A. Boologa Rambai, Ph.D.

A Phonetic and Phonological Study of the Consonants of English and Arabic ... Abdulghani A. Al-Hattami, Ph.D. Candidate

Some Aspects of Teaching-Learning English as a Second Language ...
R. Krishnaveni, M.A., M.Sc., M.Phil., Ph.D. Candidate

The Influence of First Language Grammar (L1) on the English Language (L2) Writing of Tamil School Students: A Case Study from Malaysia ...
Mahendran Maniam, Ph.D. (ESL)

Economics of Crime : A Comparative Analysis of the Socio-Economic Conditions of Convicted Female and Male Criminality In Selected Prisons in Tamil Nadu ...
S. Santhanalakshmi, Ph.D.

Technique as Voyage of Discovery: A Study of the Techniques in Dante's Paradiso ...
Raji Narasimhan, M.A.

A Critical Study of The Wasteland - Poetry as Metaphor ...
K. R. Vijaya, M.A., M.Phil.

Language and Literature: An Exposition - Papers Presented in the Karunya University National Seminar ... Editor: J. Sundar Singh, Ph.D.

Linguistic Purism and Language Planning in a Multilingual Context ...
L. Ramamoorthy, Ph.D.

Papers Presented in the All-India Conference on Multimedia Enhanced Language Teaching - MELT 2009 ...
L. Ramamoorthy, Ph.D. and J.R. Nirmala, Ph.D.

A Phonological Study of Variety of English Spoken by Oriya Speakers in Western Orissa - A Doctoral Dissertation ... Arun K. Behera, Ph.D.

Phonological Analysis of English Phonotactics of Syllable Initial and Final Consonant Clusters by Yemeni Speakers of English ... Abdulghani. M. A. Al-Shuaibi, M.A.

Journey of Self-discovery in Anita Nair's Ladies' Coupé ... V. Chandra, M.A.

The Literary Value of the Book of Isaiah ... Helen Unius Backiavathy, M.A.,M.Phil., Ph.D. Candidate

A Study of Structural Duplication in Tamil and Telugu - A Doctoral Dissertation ... Parimalagantham, Ph.D.

The Politics of Survival in the Novels of Margaret Atwood ... Pauline Das, Ph.D.

Nonverbal Communication in Tamil Novels - A Book in Tamil ...
M. S. Thirumalai, Ph.D.

Girish Karnad as a Modern Indian Dramatist - A Study ...
B. Reena, M.A., M.Phil.

A Study of English Loan Words in Selected Bahasa Melayu Newspaper Articles...
Shamimah Binti Haja Mohideen, M.HSc. (TESL)

The Internal Landscape and the Existential Agony of Women in Anjana Appachana's Novel LISTENING NOW, A Doctoral Dissertation ...
M. Poonkodi, Ph.D.

Trade in the Madras Presidency, 1941 - 1947 - A Doctoral Dissertation ...
R. Jayasurya, Ph.D.

Trends and Spatial Patterns of Crime in India - A Case Study of a District in India ...
M. Jayamala, Ph.D.

The Trading Community in Early Tamil Society Up To 900 AD ...
R. Jeyasurya, M.A., M.Phil., Ph.D.

A Study of Auxiliaries in the Old and the Middle Tamil ...
A.Boologarambai, M.A., Ph.D.

History of Growth and Reforms of British Military Administration in India, 1848-1949 ...
Hemalatha, M.A., M.Phil.

Language of Mass Media: A Study Based on Malayalam Broadcasts - A Doctoral Dissertation
K. Parameswaran, Ph.D.

Form and Function of Disorders in Verbal Narratives - A Doctoral Dissertation ...
Kandala Srinivasacharya, Ph.D.

Status Marking in Tamil - A Ph.D. Dissertation
P. Perumalsamy, Ph.D.

LANGUAGE AND POWER IN COMMUNICATION ...
Editors: Jennifer M. Bayer, Ph.D., and Pushpa Pai, Ph.D.

Onomatopoeia in Tamil ...
V. Gnanasundaram, Ph.D.

Linguistics and Literature ...
C.Shunmugom, Ph.D., and C. Sivashanmugam, Ph.D., V. Thayalan, Ph.D. and C. Sivakumar, Ph.D. (Editors)

Translation: New Dimensions ...
C.Shunmugom, Ph.D., and C. Sivashanmugam, Ph.D., Editors

Language of Headlines in Kannada Dailies ...
M. N. Leelavathi, Ph.D.

Cooperative Learning Incorporating Computer-Mediated Communication: Participation, Perceptions, and Learning Outcomes in a Deaf Education Classroom ...
Michelle Pandian, M.S.

The Effects of Age on the Ability to Learn English As a Second Language ... Mariam Dadabhai, B.A. Hons.

A STUDY OF THE SKILLS OF READING COMPREHENSION IN ENGLISH DEVELOPED BY STUDENTS OF STANDARD IX IN THE SCHOOLS IN TUTICORIN DISTRICT, TAMILNADU ...
A. Joycilin Shermila, Ph.D.

A Socio-Pragmatic Comparative Study of Ostensible Invitations in English and Farsi ...
Mohammad Ali Salmani-Nodoushan, Ph.D.

TEXT FAMILIARITY, READING TASKS, AND ESP TEST PERFORMANCE: A STUDY ON IRANIAN LEP AND NON-LEP UNIVERSITY STUDENTS - A DOCTORAL DISSERTATION ...
Mohammad Ali Salmani-Nodoushan, Ph.D.

A STUDY ON THE LEARNING PROCESS OF ENGLISH
BY HIGHER SECONDARY STUDENTS
WITH SPECIAL REFERENCE TO DHARMAPURI DISTRICT IN TAMILNADU
K. Chidambaram, Ph.D.

SPEAKING STRATEGIES TO OVERCOME COMMUNICATION DIFFICULTIES IN THE TARGET LANGUAGE SITUATION - BANGLADESHIS IN NEW ZEALAND ...
Harunur Rashid Khan

THE PROBLEMS IN LEARNING MODAL AUXILIARY VERBS IN ENGLISH AT HIGH SCHOOL LEVEL ...
Chandra Bose, Ph.D. Candidate

THE ROLE OF VISION IN LANGUAGE LEARNING in Children with Moderate to Severe Disabilities ...
Martha Louise Low, Ph.D.

SANSKRIT TO ENGLISH TRANSLATOR ...
S. Aparna, M.Sc.

A LINGUISTIC STUDY OF ENGLISH LANGUAGE CURRICULUM AT THE SECONDARY LEVEL IN BANGLADESH - A COMMUNICATIVE APPROACH TO CURRICULUM DEVELOPMENT by
Kamrul Hasan, Ph.D.

COMMUNICATION VIA EYE AND FACE in Indian Contexts by M. S. Thirumalai, Ph.D.

COMMUNICATION VIA GESTURE - Indian Contexts by
M. S. Thirumalai, Ph.D.

CIEFL Occasional Papers in Linguistics, Vol. 10

Language Acquisition, Thought and Disorder - Some Classic Positions by
M. S. Thirumalai, Ph.D.

English in India: Loyalty and Attitudes by
Annika Hohenthal

Language In Science by
M. S. Thirumalai, Ph.D.

Vocabulary Education by
B. Mallikarjun, Ph.D.

A Contrastive Analysis of Hindi and Malayalam by
V. Geethakumary, Ph.D.

Language of Advertisements in Tamil Mass Media by
Sandhya Nayak, Ph.D.

An Introduction to TESOL: Teaching English to Speakers of Other Languages by
M. S. Thirumalai, Ph.D.

Transformation of Natural Language into Indexing Language: Kannada - A Case Study by B. A. Sharada, Ph.D.

How to Learn Another Language? by M.S.Thirumalai, Ph.D.

Verbal Communication with CP Children by Shyamala Chengappa, Ph.D. and M.S.Thirumalai, Ph.D.

Bringing Order to Linguistic Diversity - Language Planning in the British Raj by
Ranjit Singh Rangila,
M. S. Thirumalai,
and B. Mallikarjun

REFERENCE MATERIALS

Some Inputs for Draft National Education Policy 2016
Ministry of Human Resource Development
Government of India

CENSUS OF INDIA 2011 General Data

CENSUS OF INDIA 2011 DATA ON DISABILITY

Action Plan of Centre for Classical Kannada

UNIVERSAL DECLARATION OF LINGUISTIC RIGHTS

Lord Macaulay and
His Minute on
Indian Education

In Defense of
Indian Vernaculars
Against
Lord Macaulay's Minute
By A Contemporary of
Lord Macaulay

Languages of India,
Census of India 1991

The Constitution of India:
Provisions Relating to
Languages

The Official
Languages Act, 1963
(As Amended 1967)

Mother Tongues of India,
According to
1961 Census of India

BACK ISSUES

FROM MARCH 2001

E-mail your articles and book-length reports in Microsoft Word to languageinindiaUSA@gmail.com.

PLEASE READ THE GUIDELINES GIVEN IN HOME PAGE IMMEDIATELY AFTER THE LIST OF CONTENTS.

Your articles and book-length reports should be written following the APA, MLA, LSA, or IJDL Stylesheet.

The Editorial Board has the right to accept, reject, or suggest modifications to the articles submitted for publication, and to make suitable stylistic adjustments. High quality, academic integrity, ethics and morals are expected from the authors and discussants.

Copyright © 2016
M. S. Thirumalai

Publisher: M. S. Thirumalai, Ph.D.
11249 Oregon Circle
Bloomington, MN 55438
USA

Custom Search

Creation and Compilation of Hindi Newspaper Text Corpus

Vandana Mishra and Niladri Sekhar Dash
Indian Statistical Institute

Abstract

Developing a corpus for the study of various aspects of a language is a highly challenging task which involves effective planning and implementation of the same. The prime concern in the development of a corpus is the overall design criteria. In this chapter we aim at presenting some theoretical guidelines on the design criteria of a one million words digital corpus of Hindi Newspaper Text Corpus (HNTC) which has been developed as a part of an on-going research activity. After the determination of the planning stage a comprehensive description of the various steps involved in the development of the corpus is discussed. An overview of the developed corpus is also highlighted with detailed specifications. Since the developed corpus has to be used subsequently for various kinds of linguistic analysis, it has been documented efficiently. This chapter also tends to give importance to documentation, storage and management of the developed corpus as it requires extreme care on the part of the corpus builder. It is a highly tedious task. Proper documentation of the corpus will ensure it authenticity and retrievability. Also, it will be utilizable for a wider range of potential areas in future.

Keywords: Corpus, Compilation, Hindi, Newspaper, Documentation

1. Introduction

The development of text corpus in Indian languages began with the generation of the Kolhapur Corpus of Indian English (KCIE) which was designed by Shastri (1988) in an effort at individual level to identify the types of similarity and difference among American English, British English and Indian English. From then onwards several attempts may have been made to develop corpora for all major Indian languages at the individual level but these are not much appreciated or attested in the history of corpus generation and application in India.

The next most important milestone in this route is the TDIL (Technology Development of Indian Languages) project which was initiated in early 1990s by Department of Electronics (DoE), Ministry of Communication and Information Technology (MCIT), Govt. of India in 1991. It was launched with a mission for developing corpora in electronic form in all Indian languages included in the 8th Schedule of the Constitution of India for subsequent works of language technology (Dash 2007). The Central Institute of Indian Languages (CIIL), Mysore was entrusted with the responsibility for coordinating the corpus development task on behalf of the MCIT as well as developing required tools and systems for conversion of the corpus into Unicode format as well as for its storage, management, dissemination, and utilization by interested researchers. The CIIL has collaborated with Lancaster University, UK for these tasks (Baker, McEnery 2003).

This is only the beginning part of the article. PLEASE CLICK HERE TO READ THE ENTIRE ARTICLE IN PRINTER-FRIENDLY VERSION.

Vandana Mishra
Senior Research Fellow
vandana.mishra87@gmail.com

Niladri Sekhar Dash
Associate Professor
ns_dash@yahoo.com

Linguistic Research Unit
Indian Statistical Institute
203 B.T Road
Kolkata-700108
West Bengal
India

Click Here to Access All the Papers of February 2018 Issue

Click Here for A PRINT VERSION OF ALL THE PAPERS OF FEBRUARY, 2018 ISSUE IN BOOK FORMAT.

Click Here for Back Issues of Language in India

Click Here for the HOME PAGE of Language in India

CONTACT EDITOR languageinindiaUSA@gmail.com

Custom Search

Click Here to Go to Creative Writing Section

Send your articles
as an attachment
to your e-mail to
languageinindiaUSA@gmail.com.
Please ensure that your name, academic degrees, institutional affiliation and institutional address, and your e-mail address are all given in the first page of your article. Also include a declaration that your article or work submitted for publication in LANGUAGE IN INDIA is an original work by you and that you have duly acknowledged the work or works of others you used in writing your articles, etc. Remember that by maintaining academic integrity we not only do the right thing but also help the growth, development and recognition of Indian/South Asian scholarship.

LANGUAGE IN INDIA

Strength for Today and Bright Hope for Tomorrow

Volume 18:2 February 2018 ISSN 1930-2940

Language in India www.languageinindia.com is included in the UGC Approved List of Journals. Serial Number 49042.

Creation and Compilation of Hindi Newspaper Text Corpus

Vandana Mishra and Niladri Sekhar Dash Indian Statistical Institute

Volume 18:2 February 2018
ISSN 1930-2940

Vandana Mishra and Niladri Sekhar Dash
Indian Statistical Institute