Implementation of the DIVA model of speech acquisition and production using PyTorch
☆22Jan 18, 2023Updated 3 years ago
Alternatives and similar repositories for DIVA_PyTorch
Users that are interested in DIVA_PyTorch are comparing it to the libraries listed below
Sorting:
- Behavioral probing of language acquisition models at the lexical and syntactic level☆18Jul 17, 2023Updated 2 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 3 years ago
- ☆10Mar 20, 2021Updated 5 years ago
- ☆20Sep 20, 2024Updated last year
- ☆57May 29, 2025Updated 9 months ago
- Sylber: Syllabic Embedding Representation of Speech from Raw Audio☆74Mar 17, 2025Updated last year
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago
- A Python package of the dynamic compressive gammachirp filterbank (dcGC-FB)☆32May 14, 2024Updated last year
- VoxAngeles Corpus☆14Aug 23, 2025Updated 6 months ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- A toolset for easy formant extraction and visualization from wav files and TTS models☆33Sep 2, 2022Updated 3 years ago
- ☆18Jan 17, 2022Updated 4 years ago
- ☆56Dec 19, 2022Updated 3 years ago
- Annotations and scripts for use with University of Wisconsin X-Ray Microbeam Speech Production Database (1994)☆13Oct 8, 2020Updated 5 years ago
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆44May 9, 2023Updated 2 years ago
- Neural Language Models as Psycholinguistic Subjects: Representations of Syntactic State☆17Mar 4, 2019Updated 7 years ago
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- A Python toolbox for text based word segmentation☆19Jan 27, 2021Updated 5 years ago
- ☆15May 8, 2021Updated 4 years ago
- Articulatory (text-to-) speech synthesis for Python☆29May 7, 2025Updated 10 months ago
- Korean read speech corpus (about 120 hours, 17GB) from National Institute of Korean Language☆43Feb 28, 2018Updated 8 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 5 years ago
- Grammar rules and dictionaries for the phonetic transcription of Russian sentences☆33Sep 23, 2021Updated 4 years ago
- ☆33Nov 27, 2021Updated 4 years ago
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago
- A free & open tool for transcribing audio interviews with offline ASR support☆25Dec 21, 2023Updated 2 years ago
- DysfluentWFST☆18Nov 13, 2025Updated 4 months ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆71Dec 2, 2022Updated 3 years ago
- ☆29May 3, 2023Updated 2 years ago
- Deep Articulatory Synthesis and Inversion☆55Feb 14, 2024Updated 2 years ago
- Official code for the "Optimal Variance Control of the Score Function Gradient Estimator for Importance Weighted Bounds"☆10Feb 16, 2023Updated 3 years ago
- Code accompanying VarGrad: A Low-Variance Gradient Estimator for Variational Inference☆12Oct 12, 2020Updated 5 years ago
- Speechflow for emotion recognition related information decomposition☆10Jul 27, 2021Updated 4 years ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 11 months ago
- Pumilio: A Web-Based Management System for Ecological Recordings☆13Oct 29, 2018Updated 7 years ago
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11May 4, 2022Updated 3 years ago
- Synthesized singing voice demos of WeSinger 2 paper.☆26Feb 20, 2023Updated 3 years ago
- ☆19Nov 4, 2022Updated 3 years ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Feb 28, 2026Updated 3 weeks ago