vincenzo-scotti / ITAcotron_2Links
Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
☆10Updated 3 years ago
Alternatives and similar repositories for ITAcotron_2
Users that are interested in ITAcotron_2 are comparing it to the libraries listed below
Sorting:
- Fast CUDA implementation of (differentiable) soft dynamic time warping for PyTorch☆716Updated last year
- Supervised Speech Representation Learning for Parkinson's Disease Classification☆15Updated 4 years ago
- This repository contains PyTorch implementation of 4 different models for classification of emotions of the speech.☆206Updated 2 years ago
- (Realtime) Temporal Convolutions in PyTorch☆168Updated 7 months ago
- This repository is a Python implementation of HMM-DNN model.☆15Updated 5 years ago
- simple version of our torch kaldi toolkit, developed at the LIA by 2 apprentices. (@Chaanks & @vbrignatz)☆10Updated 4 years ago
- INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. …☆685Updated 10 months ago
- Implementation of the paper "wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations" in Pytorch.☆51Updated 2 years ago
- This repository contains the SpeechBrain Benchmarks☆128Updated 3 months ago
- End-to-End Neural Diarization☆409Updated 4 years ago
- A library for speech data augmentation in time-domain☆676Updated 4 years ago
- An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.☆365Updated 4 years ago
- feature extraction from speech signals☆385Updated 4 months ago
- A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"☆685Updated 2 years ago
- PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."☆579Updated 2 years ago
- Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)☆74Updated 4 years ago
- Large, modern dataset for speech recognition☆702Updated last year
- Speaker embedding (d-vector) trained with GE2E loss☆287Updated last year
- Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".☆403Updated 3 years ago
- Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation☆678Updated 2 months ago
- Python package for openSMILE☆297Updated last week
- Multilingual datasets with raw audio for speech emotion recognition☆30Updated 4 years ago
- VArious audio processing tasks☆21Updated 3 years ago
- Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'☆137Updated 10 months ago
- The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number…☆562Updated last year
- Official PyTorch Implementation of CleanUNet (ICASSP 2022)☆334Updated 2 years ago
- LEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanks…☆516Updated 3 years ago
- Wav2Vec for speech recognition, classification, and audio classification☆267Updated 3 years ago
- [Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)☆1,081Updated last year
- PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean,…☆309Updated 4 years ago