☆35Sep 24, 2024Updated last year
Alternatives and similar repositories for APCodec
Users that are interested in APCodec are comparing it to the libraries listed below
Sorting:
- ☆19Mar 2, 2024Updated 2 years ago
- Source code of APNet2, a vocoder☆58Nov 23, 2023Updated 2 years ago
- ☆54Mar 2, 2023Updated 3 years ago
- ICASSP 2024 - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.☆56Nov 16, 2025Updated 4 months ago
- ☆36Jun 16, 2023Updated 2 years ago
- FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS☆24Sep 9, 2024Updated last year
- ☆46Jul 7, 2025Updated 8 months ago
- High fidelity, lightweight, end-to-end, streaming, convolution-based neural audio codec☆116Jun 23, 2025Updated 8 months ago
- Official repository for the paper Singing Voice Graph Modeling for SingFake Detection (Interspeech 2024).☆25Sep 19, 2025Updated 6 months ago
- A neural speech codec based on discrete WavLM representations☆25Aug 28, 2024Updated last year
- An neural full-band audio codec for general audio sampled at 48 kHz with 7.5 kps or 4.5 kbps.☆198Jul 14, 2025Updated 8 months ago
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆13Jul 22, 2024Updated last year
- [NAACL 2025] WaveFM: A High-Fidelity and Efficient Vocoder Based on Flow Matching☆123Mar 27, 2025Updated 11 months ago
- Source code for the EMNLP 2025 paper “DM-Codec: Distilling Multimodal Representations for Speech Tokenization”☆56Jun 1, 2025Updated 9 months ago
- Psychoacoustic Calibration for Efficient Neural Audio Coding☆26Sep 26, 2023Updated 2 years ago
- Audio Codec Speech processing Universal PERformance Benchmark☆299Jan 8, 2026Updated 2 months ago
- Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint☆78Feb 9, 2026Updated last month
- A low-bitrate single-codebook 16 / 24 kHz speech codec based on focal modulation☆156Nov 30, 2025Updated 3 months ago
- AudioCodec-Hub is a Python library for encoding and decoding audio data, supporting various neural audio codec models☆25Sep 26, 2023Updated 2 years ago
- TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching☆96Oct 9, 2025Updated 5 months ago
- ☆18Jan 10, 2024Updated 2 years ago
- ☆11Feb 14, 2025Updated last year
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆45Feb 9, 2025Updated last year
- ☆13Sep 12, 2024Updated last year
- [Interspeech 2025] DualCodec: A Low-Frame-Rate, Semantically-Enhanced Neural Audio Codec☆63Mar 11, 2026Updated last week
- Causal Speech Enhancement Based on a Two-Branch Nested U-Net Architecture Using Self-Supervised Speech Embeddings☆19Jun 6, 2025Updated 9 months ago
- ☆156Nov 22, 2024Updated last year
- unofficial implementation of the High Fidelity Neural Audio Compression☆177Aug 15, 2024Updated last year
- In this project, we will perform 12-lead ECG Multi-label Classification. Specifically, we will design a multi-model utilizing the charact…☆11Aug 26, 2024Updated last year
- An Open-source Streaming High-fidelity Neural Audio Codec☆500Mar 4, 2025Updated last year
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 11 months ago
- [ACMMM'2024] Generative Expressive Conversational Speech Synthesis☆44Oct 28, 2024Updated last year
- 2D residual U-Net (ResUNet) and a lead combiner (LC) for 12-lead ECG Abnormality Classification☆15Jan 4, 2024Updated 2 years ago
- Variable Bitrate Residual Vector Quantization for Audio Coding☆50May 1, 2025Updated 10 months ago
- ☆28Jul 15, 2024Updated last year
- The GitHub open source software repository on interpreting super-resolution CNNs for sub-pixel motion compensation in video coding☆11May 20, 2022Updated 3 years ago
- 44100Hz日本語HuBERTに対応した QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion です。☆16May 21, 2023Updated 2 years ago
- Unified Speech Language Model for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"(ICLR 2024)☆152Sep 14, 2023Updated 2 years ago
- CK-NNTest: collaboratively validating, benchmarking and optimizing neural net operators across platforms, frameworks and datasets☆15Jul 10, 2021Updated 4 years ago