Skip to main content | Skip to Navigation | Text Size : | Language:

logo of Linguistic Data Consortium for Indian Languages (LDC-IL)
About Us | Official Website of Linguistic Data Consortium for Indian Languages

About Us

Introduction

The Linguistic Data Consortium for Indian Languages (LDC-IL), established in 2007, is a scheme of the Department of Higher Education under the Ministry of Education, Government of India. It is implemented by and housed inside the Central Institute of Indian Languages (CIIL), Mysore.

Fully funded by the Government of India, the Consortium develops and distributes language resources to researchers, developers, and organizations working with Indian languages.

Since April 4, 2019, LDC-IL has been distributing linguistic resources for Artificial Intelligence (AI) and Natural Language Processing (NLP) in Indian languages through its Data Distribution Portal, launched by the Hon'ble Vice President of India, Shri M. Venkaiah Naidu.

Language data is the key ingredient in terms of research and development in the area of language technology. As the time goes by, an increasing number of researchers are seeing the potential benefits of the use of an electronic corpus as a source of empirical language data for their research. The issues surrounding collection, processing and annotation of the quantities of linguistic data make it necessary to involve a number of disciplines like linguistics, computer science, statistics, engineering etc. Corpus linguists, as we all know, often use computational methods when analyzing their data whereas the computational linguists are dependent on computer-readable linguistic data to use in their research and in building practical tools and programmes.