|
Work in Progress : Targets 2008-09
|
1. |
Five million word Corpus in the following 6 Languages :
Assamese, Bengali, Gujarati, Manipuri, Nepali and Tamil. |
2. |
I. Ten hours of Recording of Speech for Speech Corpus in the following 8 languages.
Assamese, Bengali, Gujarati, Kannada, Manipuri, Nepali and Tamil. |
|
II. Procuring of Speech Corpora to the tune of 50 hours in 8 languages. |
3. |
Multilingual dictionary (Lexipedia) in 5 Languages. |
4. |
Frequency dictionaries in 6 Languages. |
5. |
Creation of Pronunciation Dictionary in 5 Languages. |
| 6. |
Development of tools for analysis of the Text and Speech Corpus in Indian Languages.
| I. |
Automatic Transliterator |
| II. |
Frequency Analyzer |
| III. |
KWIC and KWOC Retriever |
| IV. |
Morphological Analyzer |
| V. |
Speech Data warehousing Interface |
| VI. |
Speech Synthesizer |
|
7. |
Conduct of Project Advisory Committee Meetings (2), National, Regional level Training Programmes / Workshops / Meetings and Conferences (22). |
8. |
Conduct of One International Seminar/Conference. |
9. |
Conduct of faculty improvement programme for the staff of LDC-IL (4). |
10. |
Giving Grants for the creation of language/lexical resources in Indian Languages (4). |
|