skinahan / DIVA_PyTorch
Implementation of the DIVA model of speech acquisition and production using PyTorch
☆21Updated last year
Related projects ⓘ
Alternatives and complementary repositories for DIVA_PyTorch
- Keras-based python framework to compute phonological posterior probabilities from audio files☆37Updated last year
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆30Updated last year
- ☆12Updated 3 years ago
- A Python-based modular toolbox for building Deep Neural Network models (using PyTorch) for statistical parametric speech synthesis☆23Updated 2 years ago
- ☆40Updated 2 years ago
- Sylber: Syllabic Embedding Representation of Speech from Raw Audio☆20Updated last month
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆14Updated 2 years ago
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆31Updated 4 months ago
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆18Updated last year
- ☆22Updated last year
- ☆15Updated 3 years ago
- A handy dataset of noises for ASR☆19Updated 5 years ago
- ☆26Updated 3 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated last year
- ☆31Updated last year
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆13Updated last month
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆38Updated 3 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆15Updated last year
- A Python package of the dynamic compressive gammachirp filterbank (dcGC-FB)☆27Updated 6 months ago
- Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…☆18Updated 3 years ago
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆44Updated last year
- A library of speech gadgets.☆13Updated 2 years ago
- ☆16Updated 2 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- (Hybrid) BYOL-S feature extractor using serab-byols package in pytorch.☆27Updated 7 months ago
- Learning and controlling the source-filter representation of speech with a variational autoencoder☆45Updated last year
- Code for the paper "Improving Sound Event Classification by Increasing Shift Invariance in Convolutional Neural Networks".☆13Updated last year
- A toolset for easy formant extraction and visualization from wav files and TTS models☆30Updated 2 years ago
- phone inventory library☆15Updated last year
- End-to-end diarization loss☆22Updated 3 years ago