Adibian / Persian-MultiSpeaker-Tacotron2View external linksLinks
Implementation of Transfer Learning from Speaker Verification to Multi-speaker Text-To-Speech Synthesis (SV2TTS) in Persian language.
☆13Oct 2, 2025Updated 4 months ago
Alternatives and similar repositories for Persian-MultiSpeaker-Tacotron2
Users that are interested in Persian-MultiSpeaker-Tacotron2 are comparing it to the libraries listed below
Sorting:
- Neural text to speech system that uses eSpeak as a text/phoneme front-end☆16Oct 20, 2021Updated 4 years ago
- A Grapheme to Phoneme model using LSTM implemented in pytorch☆13Jul 6, 2022Updated 3 years ago
- TTS for Singlish using Tacotron2, the IMDA corpus, and Pachyderm.☆11Jan 11, 2020Updated 6 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- Tihu dictionary for Persian language☆12Sep 8, 2019Updated 6 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Oct 2, 2024Updated last year
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆20May 20, 2025Updated 8 months ago
- ☆30Jan 22, 2026Updated 3 weeks ago
- ManaTTS is the largest open Persian speech dataset with 114+ hours of transcribed audio. Includes data collection pipeline and tools. Sui…☆48Jul 12, 2025Updated 7 months ago
- Sisyphus recipies for ASR☆18Feb 9, 2026Updated last week
- Zero-shot voice cloning text-to-speech (TTS) with explicit emotion class conditioning built on F5-TTS☆28Jan 9, 2026Updated last month
- ☆17Apr 14, 2023Updated 2 years ago
- [EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion☆33Sep 9, 2025Updated 5 months ago
- Persian text-to-speech streamlit interface☆45Dec 9, 2024Updated last year
- ☆47Dec 9, 2023Updated 2 years ago
- General tools for voice analysis.☆25Jul 30, 2025Updated 6 months ago
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆27Aug 1, 2023Updated 2 years ago
- ☆134Jul 26, 2018Updated 7 years ago
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Jun 23, 2022Updated 3 years ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Jul 6, 2022Updated 3 years ago
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Jun 16, 2022Updated 3 years ago
- ☆25Mar 6, 2024Updated last year
- ☆27Aug 10, 2024Updated last year
- A tool for translating Persian text to IPA (International Phonetic Alphabet).☆71Aug 26, 2022Updated 3 years ago
- Colab notebooks for Next-gen Kaldi☆29Oct 12, 2025Updated 4 months ago
- ☆26Nov 22, 2022Updated 3 years ago
- ☆26Dec 9, 2022Updated 3 years ago
- canvas-based talking head model using viseme data☆32Sep 4, 2023Updated 2 years ago
- ☆27Jan 19, 2021Updated 5 years ago
- A collection of inspiring lists, repos, datasets, models, tools and more for Persian language speech to text(stt) and text to speech(tts)…☆87Dec 9, 2024Updated last year
- Add Arabic diacritics (tashkeel/harakat) using Rust/Python/C++/WASM and NLP models☆45Oct 4, 2025Updated 4 months ago
- a kws demo on android☆40May 28, 2024Updated last year
- List of repositories relevant to VITS.☆36Feb 26, 2023Updated 2 years ago
- implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"☆36Dec 8, 2019Updated 6 years ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Oct 26, 2020Updated 5 years ago
- Sharif Emotional Speech Database☆39Jan 9, 2021Updated 5 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Mar 2, 2022Updated 3 years ago
- Aiming to achieve ultimate Multilingual TTS pipeline with main focus on releasing COQUI🐸TTS(Text-to-Speech) based high performing neural…☆42Aug 24, 2023Updated 2 years ago