Yip-Jia-Qi / codecformerLinks
☆17Updated last year
Alternatives and similar repositories for codecformer
Users that are interested in codecformer are comparing it to the libraries listed below
Sorting:
- Spherical residual vector quantization (SRVQ)☆30Updated last year
- ☆13Updated 5 months ago
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆36Updated 3 months ago
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆41Updated 9 months ago
- A neural speech codec based on discrete WavLM representations☆24Updated 11 months ago
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆27Updated last year
- ☆48Updated 4 months ago
- Attention-Enhanced Short-Time Wiener Solution for Acoustic Echo Cancellation☆16Updated last month
- Whisper Speech Quality Assessment (WhiSQA)☆15Updated 8 months ago
- LLaSE: Maximizing Acoustic Preservation for LLaMA based Speech Enhancement☆16Updated last month
- ☆27Updated last year
- Please visit https://thuhcsi.github.io/SnakeGAN/☆37Updated 2 years ago
- Official Repository for "Training-Free Multi-Step Audio Source Separation"☆49Updated 2 months ago
- ☆54Updated 2 years ago
- ☆40Updated last month
- A toolkit for researchers in the multimodal sound separation.☆16Updated last year
- ☆48Updated 2 months ago
- Speech Resynthesis and Language Modeling☆26Updated 2 months ago
- HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz☆24Updated last year
- Official implementation of paper: Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-…☆28Updated 11 months ago
- Evaluation tool used in the BigVSAN paper☆14Updated last year
- offical code for Dense-TSNet☆12Updated 11 months ago
- PyTorch Implementation of [WMCodec: End-to-End Neural Speech Codec with Deep Watermarking for Authenticity Verification](https://arxiv.or…☆15Updated 3 weeks ago
- Inference code for Interspeech 2025 paper, "LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec"☆20Updated 2 weeks ago
- Bilingual Singing Voice Synthesis☆18Updated last year
- ☆63Updated 2 years ago
- DiFlow-TTS: Discrete Flow Matching with Factorized Speech Tokens for Low-Latency Zero-Shot Text-to-Speech☆37Updated 2 weeks ago
- Streaming Vocos☆29Updated 2 months ago
- A Singing Style Conversion Framework Based On Audio Infilling☆26Updated 3 months ago
- Speech enhancement in noisy and reverberant environments using deep neural networks☆21Updated 2 weeks ago