A collections of audio codecs with a standardized API
☆39Apr 15, 2026Updated this week
Alternatives and similar repositories for audiocodecs
Users that are interested in audiocodecs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Nov 13, 2024Updated last year
- ☆20Nov 28, 2024Updated last year
- [ACL 2026 Main] Training, inference, and testing of the SAC speech codec model.☆100Nov 1, 2025Updated 5 months ago
- The demo page for ALMTokenizer☆59Apr 14, 2025Updated last year
- Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"☆214Sep 19, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆33Oct 23, 2025Updated 5 months ago
- Unofficial PyTorch implementation of "Autoregressive Speech Synthesis without Vector Quantization (MELLE)"☆41Jun 28, 2025Updated 9 months ago
- [ACL 2025 Main] UniCodec: a unified audio codec with a single codebook to support multi-domain audio data, including speech, music, and s…☆154May 30, 2025Updated 10 months ago
- A lightweight audio codec based on a single quantizer☆69Aug 15, 2025Updated 8 months ago
- Official repository of Myna: Masking-Based Contrastive Learning of Musical Representations☆17Mar 31, 2025Updated last year
- A small rust-based data loader☆36Feb 20, 2026Updated last month
- Detect individual instruments activity in an audio file. 🎤🎹🎸🥁☆16Jun 29, 2021Updated 4 years ago
- semantic tokenizer for speech and music☆21Jul 6, 2025Updated 9 months ago
- [NeurIPS 2024] Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution☆35Dec 23, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆349Jul 21, 2025Updated 8 months ago
- [ICASSP 2026] Task Vector in TTS: Toward Emotionally Expressive Dialectal Speech Synthesis☆39Dec 24, 2025Updated 3 months ago
- [NAACL 2025] WaveFM: A High-Fidelity and Efficient Vocoder Based on Flow Matching☆125Apr 8, 2026Updated last week
- ☆12Nov 7, 2024Updated last year
- The open source code for SimpleSpeech series☆144Oct 8, 2024Updated last year
- NEAR - Neonatal EEG Artifact Removal, an automated pipeline for pre-processing.☆12Feb 4, 2026Updated 2 months ago
- ☆14Jan 17, 2023Updated 3 years ago
- ☆157Nov 22, 2024Updated last year
- A low-bitrate single-codebook 16 / 24 kHz speech codec based on focal modulation☆162Nov 30, 2025Updated 4 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A Bach music generator with Artificial Intelligence. This model is made by a VQ-VAE + Transformer (decoder-only). Sequences of midi 1 qu…☆44Sep 21, 2023Updated 2 years ago
- [ICML 2025 Tokenization Workshop] HH-Codec: High Compression High-fidelity Discrete Neural Codec for Spoken Language Modeling☆90Sep 28, 2025Updated 6 months ago
- ☆40Jul 15, 2025Updated 9 months ago
- VoxInstruct: Expressive Human Instruction-to-Speech Generation with Unified Multilingual Codec Language Modelling☆98Nov 9, 2024Updated last year
- Lightweight Bayesian deep learning library for fast prototyping based on PyTorch☆14Feb 24, 2023Updated 3 years ago
- official implementation of MGA-CLAP (ACM MM 2024)☆30Oct 25, 2024Updated last year
- ☆48Aug 28, 2025Updated 7 months ago
- ☆26Aug 2, 2024Updated last year
- Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995☆79Dec 3, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for "Sliced Maximal Information Coefficient: A Training-Free Approach for Image Quality Assessment Enhancement"☆11Apr 18, 2024Updated 2 years ago
- MutiModel paper reading (Visual, Audio)☆21Nov 24, 2025Updated 4 months ago
- This is the official implementation for εar-VAE model including inference and evaluation parts, more details coming soon...☆69Feb 13, 2026Updated 2 months ago
- [EMNLP 2024] ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers☆125Mar 20, 2025Updated last year
- [InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter☆97Jul 4, 2024Updated last year
- ☆21Dec 19, 2023Updated 2 years ago
- Arabic Grapheme-to-Phoneme (G2P) Conversion☆13Mar 15, 2025Updated last year