Central Institute of Indian Languages [CIIL] MISSION STATEMENT:  Annotated, quality language data (both-text & speech) and tools in Indian Languages to Individuals, Institutions and Industry for Research & Development - Created in-house, through outsourcing and acquisition.  Our Other Sites  Related Sites 
You are here: BACK
Resources > Speech Corpora
Size of Speech Corpora

LANGUAGE
Dialects
No. of Female Speakers
No. of Male Speakers
Total no. of Speakers
Size of Speech Data-Female (hours)
Size of Speech Data-Male (hours)
Total Speech Data (hours)
Assamese Upper Assam, Lower Assam 154 152 306 34.5 44.5 79
Bengali SCB (Kolkata) & Barendri (North Bengal) 231 238 469 60 62 122
English (Bengali accent) Indian 27 26 53 16.5 12 28.5
English (Kannada accent) Indian 27 27 54    
English (Malayalam accent) Indian 5 5 10 8 9 17
English (Tamil accent) Indian 5 5 10 8.5 8.5 17
Gujarati Standard (Central Gujarat) & South Gujarat 125 110 235 33.5 27.5 61
Hindi Standard, Bhojpuri & Magahi 206 227 433 51.5 51.5 103
Kannada North-East, North-west and Canara 246 246 492 68 65 133
Konkani Standard 54 53 107 15.5 15.5 31
Maithili Standard 75 75 150
Malayalam Standard 79 81 160 27.5 25.5 53
Manipuri Standard & Kakching 80 82 162
Marathi Standard 75 75 150
Nepali Darjeeling & Assamese 99 97 196 26.5 30 56.5
Oriya Standard 80 82 162 18 19.5 37.5
Punjabi Standard 78 78 156 14.5 15 29.5
Tamil Standard 76 80 156 29.5 36 65.5
Telugu Standard 75 75 150
Urdu Standard 85 84 169 20 20 40
TOP BACK
You are visitor No.
WAIT...

Developed & Maintained by:
LDC-IL, CIIL
Copyright © LDC-IL,
Central Institute of Indian Languages
Central Institute of Indian Languages
Department of Higher Education
Ministry of Human Resource Development
Government of India
Manasagangothri, Hunsur Road, Mysore-570006, Karnataka, India.
Tel: (0821) 2515820 (Director)
Reception/PABX : (0821) 2345000
Fax: (0821) 2515032 (Off)
        Home | Announcements | News | CIIL | Contact Us