vincenzo-scotti / ITAcotron_2
Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
☆10Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for ITAcotron_2
- Tooling for producing Italian model (public release available) for DeepSpeech and text corpus☆93Updated 2 years ago
- This repository contains PyTorch implementation of 4 different models for classification of emotions of the speech.☆191Updated 2 years ago
- Implementation of the paper "wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations" in Pytorch.☆32Updated last year
- Wav2Vec for speech recognition, classification, and audio classification☆249Updated 2 years ago
- PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean,…☆280Updated 3 years ago
- A collection of datasets for the purpose of emotion recognition/detection in speech.☆290Updated last month
- Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'☆126Updated 2 years ago
- Official PyTorch Implementation of CleanUNet (ICASSP 2022)☆294Updated last year
- Speech Emotion Recognition using transfer learning with wav2vec on IEMOCAP.☆15Updated 3 years ago
- A unified dataset of multilingual emotional human utterances☆22Updated 2 years ago
- Transformer-based model for Speech Emotion Recognition(SER) - implemented by Pytorch☆31Updated 6 months ago
- An implementation of SoftDTW for PyTorch.☆216Updated 4 years ago
- An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.☆348Updated 3 years ago
- Mixture density network implemented in PyTorch.☆130Updated last year
- Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license s…☆547Updated 3 weeks ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆99Updated last year
- Supervised Speech Representation Learning for Parkinson's Disease Classification☆11Updated 3 years ago
- Simple, straight-forward extraction of acoustic and prosodic features from sound waves based on Praat and Parselmouth.☆20Updated 5 years ago
- HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks☆208Updated 3 years ago
- Variational Bayes HMM over x-vectors diarization☆252Updated 9 months ago
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆58Updated 2 years ago
- Speech Emotion Recognition (SER) in real-time, using Deep Neural Networks (DNN) of Long Short Memory Term (LSTM).☆91Updated 2 years ago
- [RAVDESS] Speech Emotion Recognition with Convolutional Attention based Bi-GRU. (Best test accuracy of 87%)☆23Updated last year
- ☆20Updated 5 months ago
- These are praat scripts I use in my research, implemented in parselmouth for python for use in binder☆124Updated 3 years ago
- Speech Emotion Recognition from raw speech signals using 1D CNN-LSTM☆102Updated 3 years ago
- ☆97Updated 2 years ago
- INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. …☆638Updated 3 months ago
- ☆27Updated 2 years ago
- A summary on our attempts at using Deep Learning approaches for Emotional Text to Speech☆427Updated 4 months ago