Skip to main content | Skip to Navigation | Text Size : | Language:

logo of Linguistic Data Consortium for Indian Languages (LDC-IL)
Syeda Mustafiza Tamim
Syeda Mustafiza Tamim | Official Website of Linguistic Data Consortium for Indian Languages
Academic Qualification

MA in Linguistics

Trained in
  • Leading Speech Segmentation and Data Annotation projects.

  • Proofreading, verifying, translating, and documenting Raw Speech and Text Corpora.

  • Co-authoring documentation for Raw Speech, Raw Text, and Annotated Corpora.

  • Primary author of Assamese Text-to-Speech Corpus Documentation.

  • Primary author of Assamese Sentence-Aligned Speech Corpus Documentation.

  • Actively engaged in Part-of-Speech (POS) Tagging initiatives.

  • Project Manager for Sentence-Level Annotation (SLAVAL) in the Assamese Project.

  • Project Manager for the Parallel Corpus Translation Project for Assamese and four other languages.

  • Language Expert and Project Manager for the Assamese Text-to-Speech Project.

Position held Junior Resource Person-I
Experience in research, training and documentation

Resource Person
LDC-IL, CIIL
Mysore, Karnataka
January 2020 – August 2022

Junior Resource Person- II
LDC-IL, CIIL
Mysore, Karnataka September 2022- Current

Presented/Participation in professional conferences/seminars/ workshops

• Workshop on Field Linguistics organized by Centre for Endangered Langauges, Tezpur University Assam from 2nd to 5th February,2018

• Workshop on Lexicography held by the department of EFL, and organized by Centre for Endangered Langauges, Tezpur University, Assam from 25th to 26th October,2018
• Attended lecture on “Parsing with Logic” by Prof. Gautam Sengupta, 12th August, 2020 - CALTS, University of Hyderabad.
• Attended “Basic Python” Workshop of 15 days in CIIL Organized by KM Institute of Hindi &Linguistics Collaborating with LDC-IL, CIIL .
• Attended Ntional seminar titled “Language Resources and Artificial Intelligence in Indian Language”, 15th June, 2021, LDC-IL, CIIL, Mysore.
• Participated “Summer school in Computational Linguistics” Organized by LDC-IL, CIIL, Mysore from 18th June to 3rd July 2023.
• Participated in "AI Benchmarking Conference" Organized by LDC-IL, CIIL, Mysore from 20th March to 21st March 2025.

Publications

• Main Authorship in Assamese Text to Speech Corpus, Published by Central Institute of Indian Languages, Mysore, Karnataka, ISBN - 978-93-48633-45-3.
• Main Authorship in Assamese Sentence Aligned Speech Corpus, Published by Central Institute of Indian Languages, Mysore, Karnataka, ISBN - 978-81-19411-34-4.
• Co-Authored in Assamese Raw Speech Corpus, Published by Central Institute of Indian Languages, Mysore, Karnataka, ISBN - 978-81-948885-5-0
• Co-Authored in Assamese Raw Text Corpus, Published by Central Institute of Indian Languages, Mysore, Karnataka, ISBN - 978-81-948885-4-3

Mother tongue Assamese
Other Languages known Assamese, Bengali, Hindi, English