LDC-IL (Linguistic Data Consortium for Indian Languages)

Linguistic Data Consortium for Indian Languages (LDC-IL) is established to provide annotated, quality language data in Indian Languages for Research & Development. India has a wide repertoire of languages and many initiatives have been taken in the field of Text. Speech is still a new venture in the field of NLP.
We target to:

Keeping into consideration the earnest need of quality data in Indian Languages for the research in the above mentioned fields, the idea to conduct an Event on ‘Creation of Multilingual Speech Resources’ came into existence.

Establish a National research initiative group which may lead to an International community working on Indian Languages.
Create a knowledge group to cater with issues of Building large scale speech corpora and technologies in Indian Languages.
To explore methodologies, new R&D information and emerging trends.
To exchange information regarding Indian Language Speech Resources and their applications and evaluation ongoing and planned activities.
To explore the industrial exercises and needs, requirements coming from the e-society, both with respect to policy issues and to technological and organizational entities.

Issues in the design, construction and use of language resources in speech and other associated media.
Guidelines, standards, specifications, models and best practices for language resources.
Methodologies and tools for language resources construction, annotation, extraction & acquisition of knowledge, ontologies and knowledge representation.
Integration between multilingual language resources ontologies and speech technologies.
Metadata descriptions, validation, quality assurance and evaluation of language resources.
Utilization of language resources in different types of systems and applications like: information extraction, information retrieval, speech dictation, mobile communication, summarization, semantic search, text mining, inferencing, reasoning, etc In different types of interfaces: (speech-based) dialogue systems, natural language and multimodal interactions, voice activated services, cognitive systems, etc communication with neighboring fields of applications, eg e-government, e-culture, e-health, e-participation, etc
Industrial language resources requirements.
Organizational, economical, legal and general issues regarding language resources & evaluation.
User needs usability and user satisfaction evaluation.
Benchmarking of systems and products.
Priorities, perspectives, strategies in national and international policies for language resources.

Paper including an abstract of about 100-200 words, not exceeding 10 pages should be submitted to :
ldc-anjali/ciil@ciil.stpmy.soft.net on or before 31st Dec. 2009. Authors are requested to follow ‘IEEE Format’ to format their papers and submit in *.pdf or *.rtf format. Please note papers outside the specified length are subject to rejection without review.

Minimal financial support for travel shall be provided to the authors of a few selected papers.