vincenzo-scotti / ITAcotron_2
Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
☆10Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for ITAcotron_2
- This repository contains the SpeechBrain Benchmarks☆103Updated 2 weeks ago
- SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆70Updated 4 years ago
- This repository contains PyTorch implementation of 4 different models for classification of emotions of the speech.☆191Updated 2 years ago
- Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".☆364Updated 2 years ago
- Tooling for producing Italian model (public release available) for DeepSpeech and text corpus☆93Updated 2 years ago
- An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.☆349Updated 3 years ago
- ☆28Updated 11 months ago
- WaveNet with TensorFlow 2.0☆23Updated 4 years ago
- Neural HMMs are all you need (for high-quality attention-free TTS)☆156Updated 3 weeks ago
- ☆184Updated 6 months ago
- Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'☆126Updated 2 years ago
- Wav2Vec for speech recognition, classification, and audio classification☆249Updated 2 years ago
- Pytorch implementation of conformer with with training script for end-to-end speech recognition on the LibriSpeech dataset.☆25Updated 6 months ago
- Conditional Diffusion Probabilistic Model for Speech Enhancement☆213Updated last year
- Code for voicing silent speech from EMG. Official repository for the papers "Digital Voicing of Silent Speech" at EMNLP 2020 and "An Imp…☆119Updated 6 months ago
- Reproducability code for "Characterizing soundscapes across diverse ecosystems using a universal acoustic feature set" (Sethi et. al. 202…☆12Updated 4 years ago
- Audio transformations library for PyTorch☆226Updated 2 years ago
- My-Voice Analysis is a Python library for the analysis of voice (simultaneous speech, high entropy) without the need of a transcription. …☆298Updated 3 years ago
- A library for speech data augmentation in time-domain☆647Updated 3 years ago
- Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)☆138Updated last year
- Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)☆72Updated 3 years ago
- These are praat scripts I use in my research, implemented in parselmouth for python for use in binder☆124Updated 3 years ago
- A summary on our attempts at using Deep Learning approaches for Emotional Text to Speech☆429Updated 4 months ago
- Implementation of the paper "wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations" in Pytorch.☆34Updated last year
- Embed media in a 2D scatter plot.☆15Updated 4 years ago
- Matlab tools for pathological voice analysis☆12Updated last year
- PyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]☆263Updated 5 years ago
- Audio feature extraction and multi-classification with the ECS-10 data set☆21Updated 6 years ago
- Torch implementation of Soft-DTW, supports CUDA.☆36Updated last year
- A unified dataset of multilingual emotional human utterances☆23Updated 2 years ago