skinahan / DIVA_PyTorchLinks
Implementation of the DIVA model of speech acquisition and production using PyTorch
☆21Updated 2 years ago
Alternatives and similar repositories for DIVA_PyTorch
Users that are interested in DIVA_PyTorch are comparing it to the libraries listed below
Sorting:
- A Python-based modular toolbox for building Deep Neural Network models (using PyTorch) for statistical parametric speech synthesis☆24Updated 3 years ago
- ☆17Updated last year
- Articulatory (text-to-) speech synthesis for Python☆26Updated 4 months ago
- Keras-based python framework to compute phonological posterior probabilities from audio files☆44Updated 2 years ago
- ☆12Updated 4 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- ☆15Updated 4 years ago
- ☆40Updated 3 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Updated 3 years ago
- ☆26Updated 4 years ago
- Learnable STRF, from Riad et al. 2021 JASA☆13Updated 4 years ago
- A toolset for easy formant extraction and visualization from wav files and TTS models☆31Updated 3 years ago
- Repository for multilingual speech data resources for native languages of Zambia.☆18Updated 11 months ago
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆32Updated 2 years ago
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆44Updated 2 years ago
- ☆46Updated 3 months ago
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆21Updated last year
- Lightweight speaker anonymization [IEEE SLT2021]☆27Updated 3 years ago
- A Python package of the dynamic compressive gammachirp filterbank (dcGC-FB)☆29Updated last year
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆41Updated 4 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 3 years ago
- Simple Kaldi recipe for forced alignment☆10Updated 2 years ago
- Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…☆19Updated 4 years ago
- ABX discrimination task in python☆45Updated 11 months ago
- Speechflow for emotion recognition related information decomposition☆10Updated 4 years ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Updated 3 years ago
- ☆16Updated 4 years ago
- Multipurpose Multi Speaker Mixture Signal Generator☆45Updated 7 months ago
- Tools for the automatic detection of speech-related inhalation events and characterisation of the speech respiratory cycle.☆11Updated last year
- A handy dataset of noises for ASR☆22Updated 6 years ago