vincenzo-scotti / ITAcotron_2Links
Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
☆10Updated 2 years ago
Alternatives and similar repositories for ITAcotron_2
Users that are interested in ITAcotron_2 are comparing it to the libraries listed below
Sorting:
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch☆1,611Updated last year
- A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"☆679Updated last year
- A Generative Flow for Text-to-Speech via Monotonic Alignment Search☆693Updated 3 years ago
- This repository contains the SpeechBrain Benchmarks☆123Updated last week
- Fast CUDA implementation of (differentiable) soft dynamic time warping for PyTorch☆694Updated last year
- Problem Agnostic Speech Encoder☆442Updated 2 years ago
- feature extraction from speech signals☆376Updated last month
- Large, modern dataset for speech recognition☆683Updated last year
- DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.☆852Updated last year
- End-to-End Neural Diarization☆404Updated 3 years ago
- Wav2Vec for speech recognition, classification, and audio classification☆265Updated 3 years ago
- GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis☆1,015Updated last year
- 🐸 collection of TTS papers☆707Updated last year
- The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number…☆538Updated last year
- A library for speech data augmentation in time-domain☆668Updated 3 years ago
- An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.☆359Updated 3 years ago
- 1D CNN based classifier for Speech Commands Dataset☆9Updated 7 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆2,189Updated 11 months ago
- INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. …☆677Updated 6 months ago
- Unsupervised Speech Decomposition Via Triple Information Bottleneck☆691Updated 9 months ago
- This repository contains PyTorch implementation of 4 different models for classification of emotions of the speech.☆204Updated 2 years ago
- Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.☆705Updated last year
- These are praat scripts I use in my research, implemented in parselmouth for python for use in binder☆132Updated 3 years ago
- Supervised Speech Representation Learning for Parkinson's Disease Classification☆15Updated 3 years ago
- speech to text with self-supervised learning based on wav2vec 2.0 framework☆383Updated 3 years ago
- Allosaurus is a pretrained universal phone recognizer for more than 2000 languages☆642Updated last year
- Official implementation of the paper "Automatic Severity Assessment of Dysarthric speech by using Self-supervised Model with Multi-task L…☆10Updated last year
- PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean,…☆304Updated 3 years ago
- The Implementation of FastSpeech based on pytorch.☆873Updated 2 years ago
- A summary on our attempts at using Deep Learning approaches for Emotional Text to Speech☆453Updated last year