skinahan / DIVA_PyTorchLinks
Implementation of the DIVA model of speech acquisition and production using PyTorch
☆21Updated 2 years ago
Alternatives and similar repositories for DIVA_PyTorch
Users that are interested in DIVA_PyTorch are comparing it to the libraries listed below
Sorting:
- Keras-based python framework to compute phonological posterior probabilities from audio files☆43Updated 2 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- ☆12Updated 4 years ago
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆34Updated 2 years ago
- ☆15Updated 4 years ago
- A Python-based modular toolbox for building Deep Neural Network models (using PyTorch) for statistical parametric speech synthesis☆23Updated 3 years ago
- ☆40Updated 3 years ago
- Articulatory (text-to-) speech synthesis for Python☆26Updated 7 months ago
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆44Updated 2 years ago
- ☆26Updated 4 years ago
- ☆52Updated 6 months ago
- ☆37Updated 4 years ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆41Updated 4 years ago
- Lightweight speaker anonymization [IEEE SLT2021]☆27Updated 3 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Updated last year
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Updated 3 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Updated 3 years ago
- ☆16Updated 4 years ago
- Digital Speech Processing in PyTorch.☆15Updated 3 years ago
- Phoneme segmentation using pre-trained speech models☆55Updated 3 years ago
- Learnable STRF, from Riad et al. 2021 JASA☆13Updated 4 years ago
- Deep Articulatory Synthesis and Inversion☆54Updated last year
- Repository for multilingual speech data resources for native languages of Zambia.☆19Updated last year
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆22Updated 2 years ago
- A toolset for easy formant extraction and visualization from wav files and TTS models☆32Updated 3 years ago
- A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.☆24Updated 4 years ago
- ☆18Updated last year
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Updated 5 years ago
- [SpeechCom Journal] Learning and controlling the source-filter representation of speech with a variational autoencoder☆45Updated 2 years ago
- Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…☆19Updated 4 years ago