SonyCSLParis/codicodec

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SonyCSLParis/codicodec)

SonyCSLParis / codicodec

Encode and decode audio samples to/from continuous and discrete compressed representations!

☆121

Alternatives and similar repositories for codicodec

Users that are interested in codicodec are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SonyCSLParis / music2latent
View on GitHub
Encode and decode audio samples to/from compressed latent representations!
☆267Sep 19, 2025Updated 10 months ago
YoonjinXD / kadtk
View on GitHub
A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating …
☆104Jun 12, 2025Updated last year
kandinskylab / kvae-audio
View on GitHub
KVAE-Audio: a continuous full-band audio waveform autoencoder
☆98Jun 30, 2026Updated 2 weeks ago
astradzhao / music-rfm
View on GitHub
Open Source code for our paper, Steering Autoregressive Music Generation with Recursive Feature Machines (Zhao et al., 2025). aka MusicRF…
☆40Oct 26, 2025Updated 8 months ago
bernardo-torres / linear-autoencoders
View on GitHub
Official code and pretrained models for Linear Consistency Autoencoders (Lin-CAE), a method to induce linearity in audio autoencoders via…
☆17Feb 12, 2026Updated 5 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
jasper-zheng / music2latent-scripted
View on GitHub
Scripting Music2Latent to TorchScript for streamable continuous inference in MaxMSP/PureData
☆22Feb 5, 2026Updated 5 months ago
SonyResearch / VRVQ
View on GitHub
Variable Bitrate Residual Vector Quantization for Audio Coding
☆54May 1, 2025Updated last year
yoongi43 / VRVQ
View on GitHub
Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"
☆11Apr 10, 2025Updated last year
sh-lee97 / grafx
View on GitHub
GRAFX: An Open-Source Library for Audio Processing Graphs in PyTorch
☆139Jun 29, 2026Updated 3 weeks ago
KyungsuKim42 / tokensynth
View on GitHub
The official implementation of TokenSynth (ICASSP 2025)
☆91Jun 24, 2026Updated 3 weeks ago
Aria-K-Alethia / BigCodec
View on GitHub
Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"
☆218Sep 19, 2024Updated last year
NilsDem / control-transfer-diffusion
View on GitHub
Repository for the paper "Combining audio control and style transfer using latent diffusion", accepted at ISMIR 2024
☆67Feb 19, 2025Updated last year
yhj137 / PianistTransformer
View on GitHub
This is the official implementation for the paper "Pianist Transformer: Towards Expressive Piano Performance Rendering via Scalable Self-…
☆43Jun 25, 2026Updated 3 weeks ago
sony / diffusion-timbre-transfer
View on GitHub
☆56Nov 5, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
lonzi / mrflow_dpo
View on GitHub
☆22Jan 3, 2026Updated 6 months ago
facebookresearch / dacvae
View on GitHub
DACVAE
☆226Dec 22, 2025Updated 6 months ago
loubbrad / aria-midi
View on GitHub
Official repository for Aria-MIDI: a MIDI dataset of 1,186,253 transcribed solo-piano recordings.
☆98Jun 19, 2025Updated last year
bfs18 / armel
View on GitHub
poorman's ar-dit tts
☆45Dec 31, 2025Updated 6 months ago
SonyCSLParis / audioic
View on GitHub
Estimating musical surprisal/information content in Audio
☆34Apr 9, 2026Updated 3 months ago
JusperLee / Gull-Codec-Training
View on GitHub
☆12Mar 11, 2025Updated last year
acids-ircam / platune
View on GitHub
This is the official repository of PLaTune, our Pretrained Latents Tuner model that enables to add temporal musical controls on top of pr…
☆18Jun 28, 2025Updated last year
primepake / dac_vae
View on GitHub
Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder
☆38Aug 30, 2025Updated 10 months ago
MTG / omar-rq
View on GitHub
Training, validation, and inference code for various SSL approaches and architectures.
☆87Apr 7, 2026Updated 3 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
SonyCSLParis / pesto
View on GitHub
Self-supervised learning for real-time pitch estimation
☆297Oct 15, 2025Updated 9 months ago
SonyCSLParis / ssl-singer-identity
View on GitHub
☆69Nov 6, 2023Updated 2 years ago
tencent-ailab / MuCodec
View on GitHub
☆168Nov 22, 2024Updated last year
minzwon / musicfm
View on GitHub
☆268Feb 14, 2024Updated 2 years ago
microsoft / fadtk
View on GitHub
A simple library for Fréchet Audio Distance (FAD) calculation
☆266Aug 22, 2025Updated 10 months ago
jjunak-yun / FLowHigh_code
View on GitHub
[ICASSP 2025] "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"
☆118Jan 17, 2025Updated last year
k2-fsa / Flow2GAN
View on GitHub
Hybrid Flow Matching and GAN with Multi-Resolution Network for Few-Step High-Fidelity Audio Generation
☆144Mar 8, 2026Updated 4 months ago
chrispla / mir_ref
View on GitHub
A Representation Evaluation Framework for Music Information Retrieval tasks
☆54Apr 9, 2024Updated 2 years ago
tencent-ailab / MuQ
View on GitHub
Official repository of the paper "MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization".
☆357Aug 4, 2025Updated 11 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
facebookresearch / FlowDec
View on GitHub
An neural full-band audio codec for general audio sampled at 48 kHz with 7.5 kps or 4.5 kbps.
☆212Jun 22, 2026Updated 3 weeks ago
hhguo / SoCodec
View on GitHub
Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications
☆92Dec 20, 2024Updated last year
WWWWxp / M3-TTS
View on GitHub
Pytorch Implementation of the paper "M3-TTS: Multi-modal DiT Alignment & Mel-latent for Zero-shot High-fidelity Speech Synthesis"
☆122Dec 18, 2025Updated 7 months ago
fcaspe / BRAVE
View on GitHub
Low-latency timbre transfer models for instrumental interaction.
☆106Oct 10, 2025Updated 9 months ago
xiquan-li / MeanAudio
View on GitHub
[ACL 2026 Main] MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows
☆142Sep 2, 2025Updated 10 months ago
Soul-AILab / SAC
View on GitHub
[ACL 2026 Main] Training, inference, and testing of the SAC speech codec model.
☆108Nov 1, 2025Updated 8 months ago
meaningTeam / tidy-tunes
View on GitHub
Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …
☆23May 19, 2026Updated 2 months ago