Fast Python implementation of the Yin algorithm: a fundamental frequency estimator
☆104Oct 19, 2022Updated 3 years ago
Alternatives and similar repositories for Yin
Users that are interested in Yin are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- ☆25Mar 12, 2022Updated 3 years ago
- Yin pitch estimator in PyTorch☆117Nov 7, 2022Updated 3 years ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Oct 8, 2023Updated 2 years ago
- A python wrapper for REAPER☆81Jan 22, 2025Updated last year
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Oct 28, 2019Updated 6 years ago
- ☆37May 8, 2021Updated 4 years ago
- [SpeechCom Journal] Learning and controlling the source-filter representation of speech with a variational autoencoder☆45Apr 18, 2023Updated 2 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30May 27, 2023Updated 2 years ago
- ☆18Feb 9, 2020Updated 6 years ago
- A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.☆24Mar 29, 2021Updated 4 years ago
- ICASSP2022 TTS&VC Summary☆14Jun 9, 2022Updated 3 years ago
- ☆80Aug 8, 2025Updated 6 months ago
- An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery☆26Jun 24, 2019Updated 6 years ago
- A pytroch implementation of the FB-MelGAN☆90May 26, 2020Updated 5 years ago
- ☆34Jul 16, 2019Updated 6 years ago
- This is an implementation of "Generative adversarial network-based postfilter for statistical parametric speech synthesis"☆16Jun 27, 2018Updated 7 years ago
- WIP Tensorflow implementation of https://github.com/mozilla/TTS☆15Apr 11, 2020Updated 5 years ago
- A python implementation of the Griffin Lim Algorithm for audio reconstruction from magnitudes☆34Jan 17, 2024Updated 2 years ago
- Please visit https://thuhcsi.github.io/SnakeGAN/☆37Apr 25, 2023Updated 2 years ago
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Dec 17, 2020Updated 5 years ago
- Pitch-shifting, time-stretching, and vocoding of speech with Controllable LPCNet (CLPCNet)☆164Aug 5, 2022Updated 3 years ago
- ☆25Apr 24, 2019Updated 6 years ago
- ☆55Jan 13, 2023Updated 3 years ago
- ☆51Feb 15, 2019Updated 7 years ago
- BigVGAN with Neural Source-Filter☆56Sep 21, 2023Updated 2 years ago
- VAE Tacotron 2, an alternative of GST Tacotron☆90Jul 6, 2023Updated 2 years ago
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆44May 9, 2023Updated 2 years ago
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling☆191Nov 18, 2021Updated 4 years ago
- TTS for pitch-accented language. Korean dialect DB.☆157May 12, 2023Updated 2 years ago
- NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling☆37May 25, 2021Updated 4 years ago
- A PyTorch implementation of the FFTNet: a Real-Time Speaker-Dependent Neural Vocoder☆94Jul 17, 2018Updated 7 years ago
- A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis☆114Dec 2, 2020Updated 5 years ago
- ☆10Apr 8, 2024Updated last year
- A dataset of 173 progressive metal songs, in both GuitarPro and token formats, as per the specifications in DadaGP.☆17Nov 19, 2024Updated last year
- ☆25Oct 4, 2022Updated 3 years ago
- A differentiable version of SPTK☆193Feb 26, 2026Updated last week
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆125Jun 16, 2022Updated 3 years ago
- NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis☆151Feb 11, 2023Updated 3 years ago