Sangramsingkayte / Speech

Text-to-Speech Recipe Users can create speech signals from an input text by using text-to-speech (TTS), also referred to as speech synthesis. Popular TTS and Vocoder models, such as Tacotron 2, are supported by SpeechBrain (e.g, HiFIGAN).

☆19

Alternatives and similar repositories for Speech:

Users that are interested in Speech are comparing it to the libraries listed below

muhammad-ahmed-ghani / svoice_demo
A PyTorch demo of the paper Voice Separation with an Unknown Number of Multiple Speakers using gradio and Nvidia NEMO ASR model.
☆36Updated last year
clement-pages / gryannote
Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.
☆60Updated 2 weeks ago
roatienza / efficientspeech
PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.
☆166Updated last year
neonbjb / tts-scores
Scripts for computing the Intelligibility and CLVP scores for evaluating TTS models
☆152Updated last year
dunky11 / voicesmith
[WIP] VoiceSmith makes training text to speech models easy.
☆224Updated 2 years ago
anhnh2002 / XTTSv2-Finetuning-for-New-Languages
☆125Updated 3 months ago
coqui-ai / Trainer
🐸 - A general purpose model trainer, as flexible as it gets
☆211Updated last year
Edresson / VoiceSplit
VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram
☆244Updated 8 months ago
hubertsiuzdak / voice-conversion
Voice conversion using deep adversarial learning
☆16Updated 3 years ago
unilight / seq2seq-vc
A sequence-to-sequence voice conversion toolkit.
☆96Updated 8 months ago
rishikksh20 / FastSpeech2
PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech
☆231Updated 2 years ago
miguelvalente / whisperer
Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.
☆135Updated last year
rishikksh20 / TalkNet2-pytorch
TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.
☆88Updated 3 years ago
rishikksh20 / AdaSpeech
AdaSpeech: Adaptive Text to Speech for Custom Voice
☆157Updated 3 years ago
ebadawy / voice_conversion
☆130Updated last year
Mastering-Python-GT / Transcription-diarization-whisper-pyannote
Transcription and diarization (speaker identification)
☆31Updated last year
keonlee9420 / Expressive-FastSpeech2
PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean,…
☆297Updated 3 years ago
haoheliu / voicefixer_main
General Speech Restoration
☆276Updated last year
yl4579 / StyleTTS-VC
Official Implementation of StyleTTS-VC
☆177Updated 2 months ago
cvqluu / simple_diarizer
Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code
☆146Updated 10 months ago
KoljaB / WhoSpeaks
Efficient approach to speaker diarization using voice characteristics extraction
☆92Updated 11 months ago
152334H / DL-Art-School
TorToiSe fine-tuning with DLAS
☆218Updated 7 months ago
JarodMica / tortoise_dataset_tools
Misc. tools/scripts that I made to use for tortoise
☆21Updated 7 months ago
ZDisket / TensorVox
Desktop application for neural speech synthesis written in C++
☆214Updated 2 years ago
gokulkarthik / text2speech
Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023
☆51Updated last year
keonlee9420 / Cross-Speaker-Emotion-Transfer
PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised T…
☆192Updated 2 years ago
zafarrafii / REPET-Python
REPeating Pattern Extraction Technique (REPET) in Python for audio source separation: original REPET, REPET extended, adaptive REPET, REP…
☆33Updated last year
rioharper / VocalForge
Your one-stop solution for voice dataset creation
☆118Updated last year
keonlee9420 / Comprehensive-Tacotron2
PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…
☆48Updated last year
yerfor / SyntaSpeech
SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code
☆199Updated 2 years ago