resemble-ai / ResemblyzerLinks
A python package to analyze and compare voices with deep learning
β3,088Updated last year
Alternatives and similar repositories for Resemblyzer
Users that are interested in Resemblyzer are comparing it to the libraries listed below
Sorting:
- An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.β838Updated last year
- π A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).β2,010Updated last year
- WaveRNN Vocoder + TTSβ2,165Updated 3 years ago
- π€π¬ Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.β1,152Updated last year
- π A list of accessible speech corpora for ASR, TTS, and other Speech Technologiesβ1,357Updated last year
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesisβ2,221Updated last year
- TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, Germaβ¦β3,974Updated last year
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorchβ1,620Updated last year
- DeepMind's Tacotron-2 Tensorflow implementationβ2,313Updated 2 years ago
- YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyoneβ1,021Updated 10 months ago
- A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.β1,797Updated last month
- We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new β¦β1,307Updated last year
- Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Eβ¦β1,820Updated 2 years ago
- Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing tβ¦β860Updated 2 years ago
- πΈ collection of TTS papersβ715Updated last year
- A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)β2,985Updated 2 years ago
- An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"β2,090Updated last year
- Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style trβ¦β898Updated 2 years ago
- WaveNet vocoderβ2,366Updated 2 years ago
- The PyTorch-based audio source separation toolkit for researchersβ2,453Updated last month
- AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Lossβ1,078Updated 10 months ago
- A Flow-based Generative Network for Speech Synthesisβ2,332Updated last year
- Simple text to phones converter for multiple languagesβ1,457Updated 11 months ago
- A TensorFlow Implementation of DC-TTS: yet another text-to-speech modelβ1,162Updated 2 years ago
- A Generative Flow for Text-to-Speech via Monotonic Alignment Searchβ698Updated 3 years ago
- Command line utility for forced alignment using Kaldiβ1,601Updated last week
- Tacotron 2 - PyTorch implementation with faster-than-realtime inferenceβ5,271Updated last year
- PyTorch implementation of convolutional neural networks-based text-to-speech synthesis modelsβ1,980Updated last year
- TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwβ¦β995Updated 3 months ago
- CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender β¦β836Updated 8 months ago