A collections of audio codecs with a standardized API
☆36May 27, 2025Updated 10 months ago
Alternatives and similar repositories for audiocodecs
Users that are interested in audiocodecs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆20Nov 28, 2024Updated last year
- ☆12Nov 13, 2024Updated last year
- Trainging, inference, and testing of the SAC speech codec model.☆100Nov 1, 2025Updated 4 months ago
- The demo page for ALMTokenizer☆59Apr 14, 2025Updated 11 months ago
- Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"☆213Sep 19, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆32Oct 23, 2025Updated 5 months ago
- Unofficial PyTorch implementation of "Autoregressive Speech Synthesis without Vector Quantization (MELLE)"☆41Jun 28, 2025Updated 9 months ago
- [ACL 2025 Main] UniCodec: a unified audio codec with a single codebook to support multi-domain audio data, including speech, music, and s…☆154May 30, 2025Updated 10 months ago
- A lightweight audio codec based on a single quantizer☆69Aug 15, 2025Updated 7 months ago
- Official repository of Myna: Masking-Based Contrastive Learning of Musical Representations☆17Mar 31, 2025Updated 11 months ago
- A small rust-based data loader☆36Feb 20, 2026Updated last month
- semantic tokenizer for speech and music☆21Jul 6, 2025Updated 8 months ago
- [NeurIPS 2024] Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution☆34Dec 23, 2024Updated last year
- Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆349Jul 21, 2025Updated 8 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- [ICASSP 2026] Task Vector in TTS: Toward Emotionally Expressive Dialectal Speech Synthesis☆38Dec 24, 2025Updated 3 months ago
- [NAACL 2025] WaveFM: A High-Fidelity and Efficient Vocoder Based on Flow Matching☆124Mar 27, 2025Updated last year
- [ICML 2025 Tokenization Workshop] HH-Codec: High Compression High-fidelity Discrete Neural Codec for Spoken Language Modeling☆85Sep 28, 2025Updated 6 months ago
- A low-bitrate single-codebook 16 / 24 kHz speech codec based on focal modulation☆158Nov 30, 2025Updated 4 months ago
- ☆12Nov 7, 2024Updated last year
- The open source code for SimpleSpeech series☆145Oct 8, 2024Updated last year
- NEAR - Neonatal EEG Artifact Removal, an automated pipeline for pre-processing.☆12Feb 4, 2026Updated last month
- ☆14Jan 17, 2023Updated 3 years ago
- ☆157Nov 22, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆40Jul 15, 2025Updated 8 months ago
- VoxInstruct: Expressive Human Instruction-to-Speech Generation with Unified Multilingual Codec Language Modelling☆97Nov 9, 2024Updated last year
- Lightweight Bayesian deep learning library for fast prototyping based on PyTorch☆14Feb 24, 2023Updated 3 years ago
- ☆46Aug 28, 2025Updated 7 months ago
- official implementation of MGA-CLAP (ACM MM 2024)☆30Oct 25, 2024Updated last year
- ☆25Aug 2, 2024Updated last year
- Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995☆79Dec 3, 2024Updated last year
- Code for "Sliced Maximal Information Coefficient: A Training-Free Approach for Image Quality Assessment Enhancement"☆11Apr 18, 2024Updated last year
- MutiModel paper reading (Visual, Audio)☆21Nov 24, 2025Updated 4 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- This is the official implementation for εar-VAE model including inference and evaluation parts, more details coming soon...☆68Feb 13, 2026Updated last month
- [EMNLP 2024] ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers☆125Mar 20, 2025Updated last year
- [InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter☆93Jul 4, 2024Updated last year
- ☆22Dec 19, 2023Updated 2 years ago
- Official PyTorch implementation of the paper "Graph-based Generative Face Anonymisation with Pose Preservation" in ICIAP 2021☆14Dec 13, 2021Updated 4 years ago
- Arabic Grapheme-to-Phoneme (G2P) Conversion☆13Mar 15, 2025Updated last year
- Official Repository of Paper: "SynParaSpeech: Automated Synthesis of Paralinguistic Datasets for Speech Generation and Understanding" (IC…☆67Jan 27, 2026Updated 2 months ago