davidmarttila / vocal-tract-gradView external linksLinks
Vocal Tract Area Estimation by Gradient Descent
☆38Jul 16, 2023Updated 2 years ago
Alternatives and similar repositories for vocal-tract-grad
Users that are interested in vocal-tract-grad are comparing it to the libraries listed below
Sorting:
- ☆11Nov 7, 2024Updated last year
- Glottal Flow Model-based Iterative Adaptive Inverse Filtering☆27Sep 28, 2020Updated 5 years ago
- ☆27Sep 5, 2024Updated last year
- ☆19Sep 20, 2024Updated last year
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- A three-dimensional vocal tract acoustic model using the finite-difference time-domain (FDTD) numerical scheme.☆17Sep 25, 2022Updated 3 years ago
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆44May 9, 2023Updated 2 years ago
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Dec 4, 2024Updated last year
- ☆19Mar 22, 2024Updated last year
- ☆14Aug 1, 2025Updated 6 months ago
- Reimplementation of Miipher☆29Aug 16, 2023Updated 2 years ago
- Fast and differentiable time domain all-pole filter in PyTorch.☆68Feb 5, 2026Updated last week
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPA☆18Aug 16, 2024Updated last year
- Fast and differentiable hidden Markov model in C++☆19Jan 20, 2023Updated 3 years ago
- A solution to denoising and separating for two-speaker-mixed noisy speech, using a BSRNN inspired network.☆14Aug 22, 2023Updated 2 years ago
- ESLTTS dataset☆16Feb 6, 2025Updated last year
- ☆26Apr 21, 2021Updated 4 years ago
- A Python Library for Fundamental Frequency Estimation in Music Recordings☆54Jan 16, 2026Updated last month
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆27Mar 14, 2025Updated 11 months ago
- ☆13Sep 12, 2024Updated last year
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆23Oct 30, 2024Updated last year
- Pitch Controllable DDSP Vocoders☆78Nov 9, 2024Updated last year
- GlottDNN vocoder and tools for training DNN excitation models☆32Feb 27, 2021Updated 4 years ago
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated last year
- ETH Zürich MSc Thesis: Accelerating Neural Audio Synthesis☆22Apr 10, 2023Updated 2 years ago
- Repository for multilingual speech data resources for native languages of Zambia.☆20Oct 9, 2024Updated last year
- Implementation of vocoders empowered with pytorch lightning☆18Jan 27, 2024Updated 2 years ago
- source code of EfficientTTS 2☆20Feb 18, 2024Updated 2 years ago
- [SpeechCom Journal] Learning and controlling the source-filter representation of speech with a variational autoencoder☆45Apr 18, 2023Updated 2 years ago
- ☆47Nov 13, 2021Updated 4 years ago
- A pipeline for recording datasets and running neural networks in Bela. In collaboration with @rodrigodzf, @adanlbenito and @apmcpherson☆29Dec 1, 2023Updated 2 years ago
- Pitch-shifting, time-stretching, and vocoding of speech with Controllable LPCNet (CLPCNet)☆163Aug 5, 2022Updated 3 years ago
- Speech enhancement in noisy and reverberant environments using deep neural networks☆22Oct 10, 2025Updated 4 months ago
- A family of efficient speech models for multilingual phone recognition☆42Updated this week
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆21Sep 18, 2023Updated 2 years ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Mar 7, 2023Updated 2 years ago
- Prosody and Pronunciation Modification Network☆62May 5, 2025Updated 9 months ago
- Code for the "NoiseBandNet: Controllable Time-Varying Neural Synthesis of Sound Effects Using Filterbanks" paper.☆39Jul 8, 2024Updated last year
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆37Dec 5, 2023Updated 2 years ago