vincenzo-scotti / ITAcotron_2
Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
☆10Updated 2 years ago
Alternatives and similar repositories for ITAcotron_2:
Users that are interested in ITAcotron_2 are comparing it to the libraries listed below
- Tooling for producing Italian model (public release available) for DeepSpeech and text corpus☆93Updated 2 years ago
- An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.☆356Updated 3 years ago
- DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.☆805Updated 10 months ago
- These are praat scripts I use in my research, implemented in parselmouth for python for use in binder☆127Updated 3 years ago
- A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"☆663Updated last year
- This repository contains PyTorch implementation of 4 different models for classification of emotions of the speech.☆197Updated 2 years ago
- Audio transformations library for PyTorch☆230Updated 2 years ago
- Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.☆403Updated 3 years ago
- Problem Agnostic Speech Encoder☆440Updated last year
- This repository contains the SpeechBrain Benchmarks☆107Updated this week
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆2,026Updated 6 months ago
- Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)☆72Updated 3 years ago
- A library for speech data augmentation in time-domain☆653Updated 3 years ago
- BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation☆207Updated last year
- Variational auto-encoders for audio☆117Updated 4 years ago
- A PyTorch implementation of DNN-based source separation.☆291Updated 2 years ago
- PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean,…☆295Updated 3 years ago
- Official PyTorch Implementation of CleanUNet (ICASSP 2022)☆302Updated last year
- ☆356Updated 10 months ago
- ☆44Updated 2 years ago
- ☆29Updated last year
- A live speech recognition using Facebooks wav2vec 2.0 model.☆339Updated 11 months ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆103Updated 2 years ago
- A merged version of multiple open-source German speech datasets.☆31Updated 8 months ago
- Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation☆560Updated this week
- ☆77Updated 2 years ago
- Signal to noise ratio in python☆57Updated 5 months ago
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch☆1,588Updated 9 months ago
- spafe: Simplified Python Audio Features Extraction☆464Updated 7 months ago
- Torch implementation of Soft-DTW, supports CUDA.☆36Updated last year