Adibian/Persian-MultiSpeaker-Tacotron2

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Adibian/Persian-MultiSpeaker-Tacotron2)

Adibian / Persian-MultiSpeaker-Tacotron2

Implementation of Transfer Learning from Speaker Verification to Multi-speaker Text-To-Speech Synthesis (SV2TTS) in Persian language.

☆13

Alternatives and similar repositories for Persian-MultiSpeaker-Tacotron2

Users that are interested in Persian-MultiSpeaker-Tacotron2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

talhanai / kaldi-diar-latte
View on GitHub
steps to perform text-based speaker diarization with kaldi toolkit
☆12Nov 2, 2018Updated 7 years ago
pkufool / simple-wer
View on GitHub
A simple command line tool to calculate WER for ASR.
☆14Updated this week
bshall / dusted
View on GitHub
DUSTED: Spoken-Term Discovery using Discrete Speech Units
☆17Oct 2, 2024Updated last year
sil-ai / tts-singlish
View on GitHub
TTS for Singlish using Tacotron2, the IMDA corpus, and Pachyderm.
☆11Jan 11, 2020Updated 6 years ago
M-Taghizadeh / Persian_Question_Answering_Voice2Voice_AI
View on GitHub
This repository hosts BonyadAI, a Persian question answering AI Model. We developed an initial web crawler and scraper to gather the data…
☆12Jul 7, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
alumae / streaming-punctuator
View on GitHub
☆17Apr 14, 2023Updated 3 years ago
rhasspy / glow-speak
View on GitHub
Neural text to speech system that uses eSpeak as a text/phoneme front-end
☆16Oct 20, 2021Updated 4 years ago
tihu-nlp / tihudict
View on GitHub
Tihu dictionary for Persian language
☆13Sep 8, 2019Updated 6 years ago
de-mh / g2p_fa
View on GitHub
A Grapheme to Phoneme model using LSTM implemented in pytorch
☆14Jul 6, 2022Updated 4 years ago
savariamir / Finity
View on GitHub
Finity is a .NET Core resilience and Fault tolerance library that allows developers to extend IHttpClientFactory such as Retry, Circuit …
☆20Dec 26, 2022Updated 3 years ago
rwth-i6 / i6_core
View on GitHub
Sisyphus recipies for ASR
☆19Jul 17, 2026Updated last week
choiHkk / Transformer-TTS-V2
View on GitHub
☆25Mar 6, 2024Updated 2 years ago
SadeghKrmi / pertts-streamlit
View on GitHub
Persian text-to-speech streamlit interface
☆49Dec 9, 2024Updated last year
haraai / ParsiNorm
View on GitHub
☆46Dec 9, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
haoheliu / DCASE_2022_Task_5
View on GitHub
System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection
☆28Jul 6, 2022Updated 4 years ago
kyutai-labs / tts_longeval
View on GitHub
☆30Apr 29, 2026Updated 3 months ago
thecmdrunner / remotion-gtts-template
View on GitHub
Remotion text-to-speech template using Google Cloud and Firebase
☆18Feb 20, 2026Updated 5 months ago
m-wiesner / nnet_pytorch
View on GitHub
Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.
☆26Jul 25, 2024Updated 2 years ago
filipezabala / voice
View on GitHub
General tools for voice analysis.
☆25May 13, 2026Updated 2 months ago
audiolabs / PESQ
View on GitHub
PESQ (Perceptual Evaluation of Speech Quality) Wrapper for Python Users (narrow band and wide band) - including P.862 Corrigendum 2 (03/…
☆23May 27, 2025Updated last year
jeremybytes / csharp-channels-presentation
View on GitHub
Code samples, slides, and links for "Better Parallel Code with C# Channels"
☆32Sep 6, 2024Updated last year
msalhab96 / MultiSpeech
View on GitHub
pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper
☆21Jun 23, 2022Updated 4 years ago
csun22 / LibriVoc-Dataset
View on GitHub
LibriVoc is a new open-source, large-scale dataset for vocoder artifact detection. LibriVoc is derived from the LibriTTS speech corpus, w…
☆16Nov 6, 2025Updated 8 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
pnkvalavala / multivoice
View on GitHub
Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …
☆27Aug 1, 2023Updated 2 years ago
Kagnite / Kernel-Ruler
View on GitHub
☆18Aug 23, 2025Updated 11 months ago
CMsmartvoice / Unet-TTS
View on GitHub
One-shot TTS with Improved Unseen Speaker and Style Transfer
☆37Mar 2, 2022Updated 4 years ago
AlisterTA / Persian-text-to-speech
View on GitHub
☆132Jul 26, 2018Updated 8 years ago
gunnxx / indonesian-mt-data
View on GitHub
Benchmarking Multidomain English-Indonesian Machine Translation
☆16Dec 19, 2020Updated 5 years ago
rhasspy / fa_kaldi-rhasspy
View on GitHub
Persian Kaldi profile for Rhasspy built from open speech data
☆17Oct 13, 2021Updated 4 years ago
de-mh / persian_phonemizer
View on GitHub
A tool for translating Persian text to IPA (International Phonetic Alphabet).
☆73Aug 26, 2022Updated 3 years ago
34j / awesome-vits
View on GitHub
List of repositories relevant to VITS.
☆36Feb 26, 2023Updated 3 years ago
zhu-han / SpeechLLM
View on GitHub
LLM-based ASR recipe with Zipformer encoder and Qwen LLM
☆35Sep 25, 2025Updated 10 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
PacktPublishing / Microservices-Design-Patterns-in-.NET---Second-Edition
View on GitHub
Microservices Design Patterns in .NET - Second Edition, published by Packt
☆33Apr 30, 2026Updated 2 months ago
MahtaFetrat / ManaTTS-Persian-Speech-Dataset
View on GitHub
ManaTTS is the largest open Persian speech dataset with 114+ hours of transcribed audio. Includes data collection pipeline and tools. Sui…
☆54Jul 12, 2025Updated last year
robin1001 / kws_on_android
View on GitHub
a kws demo on android
☆40May 28, 2024Updated 2 years ago
rrkarim / unbounded-cache-lm
View on GitHub
Unbounded cache model for online language modeling with open vocabulary
☆11Feb 15, 2019Updated 7 years ago
mdsecactivebreach / morphHTA
View on GitHub
morphHTA - Morphing Cobalt Strike's evil.HTA
☆11Jun 3, 2017Updated 9 years ago
jetfontanilla / canvas-talking-head-model
View on GitHub
canvas-based talking head model using viseme data
☆32Sep 4, 2023Updated 2 years ago
jongalloway / dotnet-mcp
View on GitHub
MCP wrapper for the .NET SDK
☆29Updated this week