resemble-ai / ResemblyzerLinks
A python package to analyze and compare voices with deep learning
β3,026Updated last year
Alternatives and similar repositories for Resemblyzer
Users that are interested in Resemblyzer are comparing it to the libraries listed below
Sorting:
- WaveRNN Vocoder + TTSβ2,166Updated 3 years ago
- π A list of accessible speech corpora for ASR, TTS, and other Speech Technologiesβ1,341Updated last year
- An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.β838Updated last year
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesisβ2,171Updated 11 months ago
- π A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).β1,966Updated last year
- π€π¬ Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.β1,149Updated last year
- A Flow-based Generative Network for Speech Synthesisβ2,331Updated last year
- A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.β1,769Updated 8 months ago
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorchβ1,609Updated last year
- A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)β2,980Updated 2 years ago
- DeepMind's Tacotron-2 Tensorflow implementationβ2,309Updated 2 years ago
- Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Eβ¦β1,799Updated 2 years ago
- Tacotron 2 - PyTorch implementation with faster-than-realtime inferenceβ5,247Updated last year
- WaveNet vocoderβ2,359Updated last year
- Unofficial PyTorch implementation of Google AI's VoiceFilter systemβ1,144Updated 11 months ago
- Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style trβ¦β899Updated 2 years ago
- PyTorch implementation of convolutional neural networks-based text-to-speech synthesis modelsβ1,980Updated last year
- πΈSTT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.β2,465Updated last year
- The PyTorch-based audio source separation toolkit for researchersβ2,414Updated 6 months ago
- An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"β2,055Updated last year
- YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyoneβ992Updated 8 months ago
- TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, Germaβ¦β3,949Updated last year
- We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new β¦β1,302Updated last year
- Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing tβ¦β861Updated last year
- AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Lossβ1,068Updated 8 months ago
- A Python/Pytorch app for easily synthesising human voicesβ1,445Updated 7 months ago
- πΈ collection of TTS papersβ703Updated last year
- Command line utility for forced alignment using Kaldiβ1,525Updated this week
- GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesisβ1,013Updated last year
- A TensorFlow Implementation of DC-TTS: yet another text-to-speech modelβ1,162Updated 2 years ago