zjlww / dspView external linksLinks
Digital Speech Processing in PyTorch.
☆15Aug 12, 2022Updated 3 years ago
Alternatives and similar repositories for dsp
Users that are interested in dsp are comparing it to the libraries listed below
Sorting:
- Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…☆19Jun 14, 2021Updated 4 years ago
- ☆36Mar 14, 2025Updated 11 months ago
- Simple torch.nn.module implementation of Alias-Free-GAN style filter and resample☆98Jul 26, 2022Updated 3 years ago
- ☆27Sep 5, 2024Updated last year
- Automatically Update LLM Papers Daily using Github Actions. Ref: https://github.com/Vincentqyw/cv-arxiv-daily☆10Updated this week
- Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals☆18Aug 8, 2024Updated last year
- This repository provides UNOFFICIAL Bunched LPCNet implementations with Pytorch.☆14Jun 17, 2021Updated 4 years ago
- Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995☆78Dec 3, 2024Updated last year
- ☆26Apr 21, 2021Updated 4 years ago
- Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.☆157Jul 2, 2021Updated 4 years ago
- Multispeaker Community Vocoder Model for DiffSinger☆39Aug 11, 2025Updated 6 months ago
- Evaluation tool used in the BigVSAN paper☆14Mar 22, 2024Updated last year
- ☆14Aug 19, 2024Updated last year
- Models and code for RepCodec: A Speech Representation Codec for Speech Tokenization☆191Jul 12, 2024Updated last year
- ☆19Sep 20, 2024Updated last year
- GPT-style network for phonemization with durations of text☆68Mar 21, 2024Updated last year
- The Multi-band Excited WaveNet☆15Feb 2, 2023Updated 3 years ago
- Official PyTorch implementation for "Understanding Instance-based Interpretability of Variational Auto-Encoders."☆13Oct 21, 2021Updated 4 years ago
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Sep 24, 2021Updated 4 years ago
- [Interspeech 2025] Official implementation of "Training-Free Voice Conversion with Factorized Optimal Transport"☆43Sep 24, 2025Updated 4 months ago
- ☆16Dec 23, 2021Updated 4 years ago
- Megatts2 use HierSpeechpp's vocoder☆18Dec 2, 2024Updated last year
- RepVgg + HiFiGAN☆36Aug 10, 2022Updated 3 years ago
- PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis☆69Aug 3, 2021Updated 4 years ago
- ABX and kaldi experiments on speech corpora made easy☆33Oct 7, 2024Updated last year
- High-Fidelity Neural Phonetic Posteriorgrams☆122Feb 23, 2025Updated 11 months ago
- ☆20Jul 22, 2022Updated 3 years ago
- ☆20Jun 5, 2022Updated 3 years ago
- [Official Implementation] Acoustic Autoregressive Modeling 🔥☆74Aug 24, 2024Updated last year
- Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report☆49Sep 2, 2025Updated 5 months ago
- ☆16Dec 18, 2023Updated 2 years ago
- The official implementation of DMEL the method presented in the paper "DMEL: The differentiable log-Mel spectrogram as a trainable layer …☆22Dec 21, 2024Updated last year
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆18Nov 30, 2022Updated 3 years ago
- Parallel waveform generation with DiffusionGAN☆17Mar 26, 2022Updated 3 years ago
- ☆20Jul 13, 2022Updated 3 years ago
- ☆19May 2, 2024Updated last year
- [EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion☆33Sep 9, 2025Updated 5 months ago
- Differentiable dynamic range controller in PyTorch.☆52Jan 11, 2026Updated last month
- Implementation of DCComix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer☆75Aug 21, 2023Updated 2 years ago