skinahan / DIVA_PyTorchLinks
Implementation of the DIVA model of speech acquisition and production using PyTorch
☆21Updated 2 years ago
Alternatives and similar repositories for DIVA_PyTorch
Users that are interested in DIVA_PyTorch are comparing it to the libraries listed below
Sorting:
- Keras-based python framework to compute phonological posterior probabilities from audio files☆43Updated 2 years ago
- ☆12Updated 4 years ago
- A Python-based modular toolbox for building Deep Neural Network models (using PyTorch) for statistical parametric speech synthesis☆24Updated 3 years ago
- ☆15Updated 4 years ago
- ☆40Updated 3 years ago
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆33Updated 2 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- Repository for multilingual speech data resources for native languages of Zambia.☆17Updated last year
- Learnable STRF, from Riad et al. 2021 JASA☆13Updated 4 years ago
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆44Updated 2 years ago
- ☆17Updated last year
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆41Updated 4 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Updated 3 years ago
- ☆26Updated 4 years ago
- Articulatory (text-to-) speech synthesis for Python☆26Updated 5 months ago
- ☆49Updated 4 months ago
- ☆16Updated 4 years ago
- A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.☆24Updated 4 years ago
- ☆37Updated 4 years ago
- Lightweight speaker anonymization [IEEE SLT2021]☆27Updated 3 years ago
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆21Updated last year
- Audio activity detector based on per-channel energy normalization (PCEN)☆29Updated 6 years ago
- Balanced Error Rate for Speaker Diarization☆32Updated 2 years ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Updated 3 years ago
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Updated 5 years ago
- A toolset for easy formant extraction and visualization from wav files and TTS models☆31Updated 3 years ago
- A handy dataset of noises for ASR☆22Updated 6 years ago
- A Python package of the dynamic compressive gammachirp filterbank (dcGC-FB)☆29Updated last year
- Multipurpose Multi Speaker Mixture Signal Generator☆45Updated 8 months ago
- ☆25Updated this week