Advancing Tibetan Text-to-Speech: Challenges and Innovations
- Title
- Advancing Tibetan Text-to-Speech: Challenges and Innovations
- Creator
- Choedon, Tenzin; Nagaraju, Shamanth; Rekha, V.
- Description
- This initiative aims to develop a platform for Tibetan Text-to-Speech (TTS) technology, addressing the significant demand for this technology for the Tibetan language. The main objective of this project is to create a system that is capable of converting text into natural and good quality speech. Through the compilation of Tibetan text-audio datasets, the project meets the increasing demand for technology that preserves oral traditions and allows Tibetans to communicate with other people interested in the language. The process includes the gathering of varied Tibetan text and audio samples, such as news articles, followed by processing of data through cleaning processes and statistical analysis. A benchmark dataset is created to enable the testing of models. The lack of certain resources for Tibetan TTS is addressed by the development of pre-trained machine learning models specific to acoustic modeling, using the adapted FastPitch model for waveform synthesis through the HiFi-GAN vocoder. The existing models were?further trained utilizing features particular to Tibetan phonetics and tonalities. The TTS approach is a key strategy for improving digital accessibility for Tibetan speakers and for safeguarding their cultural heritage; it finds applications in media, education, and communication, thus helping to preserve the Tibetan language in the digital era. The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2026.
- Source
- Lecture Notes in Networks and Systems;Volume;1737 LNNS;pp.182-192
- Date
- 01-01-2026
- Publisher
- Springer Science and Business Media Deutschland GmbH
- Subject
- FastPitch; Fine-tuned machine learning models; HiFi-GAN; Phonetic adaptation; Text to audio processing; Tibetan Text-to-Speech
- Coverage
- Choedon T., Department of Computer Science and Engineering, School of Technology and Engineering, Christ University, Bangalore, India; Nagaraju S., Christ University, Bangalore, India; Rekha V., Christ University, Bangalore, India
- Rights
- Restricted Access; Hardcopy may be available in the library
- Relation
- ISSN: 23673370; ISBN: 978-981955081-4;
- Format
- online
- Language
- English
- Type
- Conference paper
Collection
Citation
Choedon, Tenzin; Nagaraju, Shamanth; Rekha, V., “Advancing Tibetan Text-to-Speech: Challenges and Innovations,” CHRIST (Deemed To Be University) Institutional Repository, accessed June 18, 2026, https://archives.christuniversity.in/items/show/25445.
