DBraun/DAC-JAX

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/DBraun/DAC-JAX)

DBraun / DAC-JAX

JAX Implementations of Descript Audio Codec and EnCodec

☆37

Alternatives and similar repositories for DAC-JAX

Users that are interested in DAC-JAX are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ETH-DISCO / discoder
View on GitHub
Official repo for DisCoder: High-Fidelity Music Vocoder using Neural Audio Codecs presented at ICASSP 2025
☆42Feb 24, 2025Updated last year
YangXusheng-yxs / CodecFormer_5Hz
View on GitHub
☆35Oct 23, 2025Updated 9 months ago
exercise-book-yq / Supercodec
View on GitHub
☆51Mar 5, 2026Updated 4 months ago
exercise-book-yq / FreeCodec
View on GitHub
FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS
☆24Sep 9, 2024Updated last year
orchidas / DiffGFDN
View on GitHub
Differentiable grouped feedback delay networks for late reverberation modelling in coupled spaces
☆16Jun 4, 2026Updated last month
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
slSeanWU / beats-conformer-bart-audio-captioner
View on GitHub
PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…
☆41Jan 6, 2024Updated 2 years ago
line / WaveTrainerFit
View on GitHub
Official implementation of "Wave-Trainer-Fit: Neural Vocoder with Trainable Prior and Fixed-Point Iteration towards High-Quality Speech G…
☆16Feb 6, 2026Updated 5 months ago
xkx-hub / KALL-E
View on GitHub
[AAAI 2026 oral] KALL-E:Autoregressive Speech Synthesis with Next-Distribution Prediction
☆42Sep 25, 2025Updated 9 months ago
ozspeech / OZSpeech
View on GitHub
[ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching
☆45Feb 9, 2025Updated last year
boris-kuz / jaxloudnorm
View on GitHub
Jax implementation of a flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm
☆13Jan 29, 2025Updated last year
dinhoitt / BemaGANv2
View on GitHub
☆21Mar 3, 2026Updated 4 months ago
bineferg / thunder-synthesis
View on GitHub
A generative synthesised sound effect that can dynamically alter context-dependent mixing techniques and audio effects found at nemisindo…
☆13Apr 19, 2022Updated 4 years ago
lucadellalib / discrete-wavlm-codec
View on GitHub
A neural speech codec based on discrete WavLM representations
☆26Aug 28, 2024Updated last year
AbrahamSanders / codec-bpe
View on GitHub
Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs
☆76Dec 3, 2025Updated 7 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
furkanyesiler / re-move
View on GitHub
Training and evaluation code for Re-MOVE models with embedding distillation
☆31Jul 6, 2023Updated 3 years ago
zhai-lw / L3AC
View on GitHub
A lightweight audio codec based on a single quantizer
☆35Sep 4, 2025Updated 10 months ago
brianfitzgerald / jax-mmdit
View on GitHub
Implementation of Diffusion Transformers and Rectified Flow in Jax
☆27Jul 9, 2024Updated 2 years ago
google / sequence-layers
View on GitHub
A neural network layer API and library for sequence modeling, designed for easy creation of sequence models that can be executed layerwis…
☆64Jun 26, 2026Updated 3 weeks ago
haoheliu / SemantiCodec
View on GitHub
☆45Jun 11, 2024Updated 2 years ago
luotianze666 / WaveFM
View on GitHub
[NAACL 2025] WaveFM: A High-Fidelity and Efficient Vocoder Based on Flow Matching
☆133Apr 8, 2026Updated 3 months ago
thuhcsi / VoxInstruct
View on GitHub
VoxInstruct: Expressive Human Instruction-to-Speech Generation with Unified Multilingual Codec Language Modelling
☆100Nov 9, 2024Updated last year
zhai-lw / SQCodec
View on GitHub
A lightweight audio codec based on a single quantizer
☆72Aug 15, 2025Updated 11 months ago
flamed-tts / Flamed-TTS
View on GitHub
This repository implement a novel zero-shot TTS framework, named Flamed-TTS, focusing on the efficient generation and dynamic pacing in …
☆57Aug 9, 2025Updated 11 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
seongho608 / RingFormer
View on GitHub
☆52Jun 24, 2025Updated last year
symoon11 / dreamerv3-flax
View on GitHub
Flax Implementation of DreamerV3 on Crafter
☆18Nov 29, 2025Updated 7 months ago
zjlww / dsp
View on GitHub
Digital Speech Processing in PyTorch.
☆15Aug 12, 2022Updated 3 years ago
line / open-universe
View on GitHub
Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.
☆118Aug 29, 2024Updated last year
supki / liblastfm
View on GitHub
Lastfm API interface.
☆13Mar 22, 2020Updated 6 years ago
gregogiudici / python-stretch
View on GitHub
Simple python library for pitch shifting and time stretching. Wrapper of Signalsmith Stretch C++ Library
☆16May 24, 2026Updated last month
MuyangDu / T5Voice
View on GitHub
T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …
☆28Nov 7, 2025Updated 8 months ago
RayYuki / CodecBench
View on GitHub
☆24Nov 16, 2025Updated 8 months ago
bfs18 / armel
View on GitHub
poorman's ar-dit tts
☆45Dec 31, 2025Updated 6 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
DiffAPF / LA-2A
View on GitHub
Feed-forward compressor experiments source code for "Differentiable All-pole Filters for Time-varying Audio Systems".
☆24Jun 10, 2024Updated 2 years ago
ryota-komatsu / speaker_disentangled_hubert
View on GitHub
Official repository of the IEEE OJSP paper "Speaker-Disentangled Chunk-Wise Regression for Syllabic Tokenization"
☆46Updated this week
OlaWod / PitchVC
View on GitHub
PitchVC: Pitch Conditioned Any-to-Many Voice Conversion
☆35Jun 6, 2024Updated 2 years ago
steckes / rust-audio-plugin
View on GitHub
The most simple scaffold for an audio plugin in Rust using nih-plug
☆15May 18, 2025Updated last year
ydqmkkx / ShallowFlowMatching-TTS
View on GitHub
Official implementation of paper: Shallow Flow Matching for Coarse-to-Fine Text-to-Speech Synthesis
☆55Sep 20, 2025Updated 10 months ago
SarthakYadav / audax
View on GitHub
A home for audio ML in JAX. Has common features, learnable frontends, pretrained supervised and self-supervised models.
☆72Jul 24, 2022Updated 3 years ago
pier-maker92 / ADT_STR
View on GitHub
Automatic Drum Transcription with CLAP-based unsupervised sample curation for synthetic training data generation.
☆18May 5, 2026Updated 2 months ago