Sangramsingkayte / Speech
Text-to-Speech Recipe Users can create speech signals from an input text by using text-to-speech (TTS), also referred to as speech synthesis. Popular TTS and Vocoder models, such as Tacotron 2, are supported by SpeechBrain (e.g, HiFIGAN).
☆19Updated 3 months ago
Alternatives and similar repositories for Speech:
Users that are interested in Speech are comparing it to the libraries listed below
- A PyTorch demo of the paper Voice Separation with an Unknown Number of Multiple Speakers using gradio and Nvidia NEMO ASR model.☆36Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆60Updated 2 weeks ago
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆166Updated last year
- Scripts for computing the Intelligibility and CLVP scores for evaluating TTS models☆152Updated last year
- [WIP] VoiceSmith makes training text to speech models easy.☆224Updated 2 years ago
- ☆125Updated 3 months ago
- 🐸 - A general purpose model trainer, as flexible as it gets☆211Updated last year
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆244Updated 8 months ago
- Voice conversion using deep adversarial learning☆16Updated 3 years ago
- A sequence-to-sequence voice conversion toolkit.☆96Updated 8 months ago
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech☆231Updated 2 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆135Updated last year
- TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.☆88Updated 3 years ago
- AdaSpeech: Adaptive Text to Speech for Custom Voice☆157Updated 3 years ago
- ☆130Updated last year
- Transcription and diarization (speaker identification)☆31Updated last year
- PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean,…☆297Updated 3 years ago
- General Speech Restoration☆276Updated last year
- Official Implementation of StyleTTS-VC☆177Updated 2 months ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆146Updated 10 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆92Updated 11 months ago
- TorToiSe fine-tuning with DLAS☆218Updated 7 months ago
- Misc. tools/scripts that I made to use for tortoise☆21Updated 7 months ago
- Desktop application for neural speech synthesis written in C++☆214Updated 2 years ago
- Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023☆51Updated last year
- PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised T…☆192Updated 2 years ago
- REPeating Pattern Extraction Technique (REPET) in Python for audio source separation: original REPET, REPET extended, adaptive REPET, REP…☆33Updated last year
- Your one-stop solution for voice dataset creation☆118Updated last year
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated last year
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code☆199Updated 2 years ago