aqtq314 / VogenSVS
☆10Updated last week
Alternatives and similar repositories for VogenSVS:
Users that are interested in VogenSVS are comparing it to the libraries listed below
- Spherical residual vector quantization (SRVQ)☆28Updated 7 months ago
- FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS☆19Updated 6 months ago
- HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz☆24Updated last year
- ☆21Updated last year
- ☆12Updated 3 weeks ago
- ☆15Updated 6 months ago
- ☆16Updated 8 months ago
- ICASSP2025Dynamic Embedding Causal Target Speech Extraction☆2Updated 2 weeks ago
- real-time speech enhance☆13Updated last year
- The implementation of MDNet, which is in submission to Interspeech2022☆13Updated 2 years ago
- ☆11Updated 2 years ago
- Please visit https://thuhcsi.github.io/SnakeGAN/☆36Updated last year
- A chinese singing voice dataset, professional male singer, 105 songs, 132 minutes☆11Updated last year
- Mutiband version of HIFIGAN☆18Updated 4 years ago
- This is the implementation of the manuscript "Learning General All-Neural Speech Enhancement based on Taylor's Approximation Theory", whi…☆14Updated 2 years ago
- ☆14Updated last month
- ☆23Updated 11 months ago
- Zero-Shot Blind Audio Bandwidth Extension☆20Updated last year
- Streaming Vocos☆22Updated 2 months ago
- ☆13Updated 5 months ago
- ☆47Updated this week
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …☆16Updated this week
- Reimplementation of Miipher☆20Updated last year
- GlottDNN vocoder and tools for training DNN excitation models☆32Updated 4 years ago
- ☆11Updated last month
- ☆26Updated last year
- ☆21Updated 2 years ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆33Updated last year
- ☆13Updated last year
- A toolkit for researchers in the multimodal sound separation.☆16Updated last year