RoganInglis/AudioLM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/RoganInglis/AudioLM)

RoganInglis / AudioLM

☆24

Alternatives and similar repositories for AudioLM

Users that are interested in AudioLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yoyolicoris / spectrogram-inversion
View on GitHub
spectrogram inversion tools in PyTorch. Documentation: https://spectrogram-inversion.readthedocs.io
☆51Jun 12, 2025Updated last year
ArenAcikgoz / Whisper-Alignment
View on GitHub
Forced alignment decoder for Whisper.
☆16Mar 13, 2024Updated 2 years ago
yoyolicoris / wavenet-like-vocoder
View on GitHub
Basic wavenet and fftnet vocoder model.
☆19Feb 7, 2022Updated 4 years ago
MingjieChen / VoiceConversionGANs
View on GitHub
GAN series for voice conversion on VCC2018 dataset
☆17Aug 27, 2020Updated 5 years ago
yazgoo / blockish-caca
View on GitHub
video players in the terminal with blockish over libcaca with LD_PRELOAD magic
☆10Oct 16, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
exercise-book-yq / Supercodec
View on GitHub
☆51Mar 5, 2026Updated 4 months ago
colaudiolab / AudioSet-R
View on GitHub
Official implementation: "AudioSet-R: A Refined AudioSet with Multi-Stage LLM Label Reannotation"
☆19Oct 9, 2025Updated 9 months ago
jwr1995 / DTCN
View on GitHub
☆19Oct 26, 2023Updated 2 years ago
BrightGu / MediumVC
View on GitHub
Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features
☆55Oct 11, 2021Updated 4 years ago
taehong-moon / ee-diffusion
View on GitHub
Official Code Implementation for 'A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models'
☆20Jul 24, 2024Updated 2 years ago
AgentCooper2002 / EDMSound
View on GitHub
Codebase and project page for EDMSound
☆35Nov 20, 2023Updated 2 years ago
kyutai-labs / jax-flash-attn3
View on GitHub
JAX bindings for the flash-attention3 kernels
☆23Jan 2, 2026Updated 6 months ago
topel / audioset-convnext-inf
View on GitHub
Adapting a ConvNeXt model to audio classification on AudioSet
☆27Feb 19, 2025Updated last year
Ereboas / MagiCodec
View on GitHub
A single-layer, streaming codec model providing SOTA audio quality and discrete tokens designed for superior downstream modelability.
☆125Jun 4, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
CarlWangChina / MuChin
View on GitHub
MuChin: A Chinese Colloquial Description Benchmark for Evaluating Language Models in the Field of Music
☆27Jan 7, 2026Updated 6 months ago
AlessioSam / LADiff
View on GitHub
The official PyTorch implementation of "The 18th European Conference on Computer Vision" (ECCV 2024) paper Length-Aware Motion Synthesis …
☆19Dec 15, 2024Updated last year
music-x-lab / Self-Supervised-Metrical-Structure
View on GitHub
☆15Sep 20, 2023Updated 2 years ago
zachary-shah / riff-cnet
View on GitHub
Controlled audio inpainting using SD-fine tuned model Riffusion in a ControlNet Architecture
☆33May 31, 2023Updated 3 years ago
jaeyeun97 / MelNet
View on GitHub
A Pytorch Implementation of MelNet
☆26Apr 13, 2020Updated 6 years ago
paarthneekhara / advoc
View on GitHub
Vocode spectrograms to audio with generative adversarial networks
☆64Aug 8, 2019Updated 6 years ago
4AI / langml
View on GitHub
A Keras-based and TensorFlow-backend NLP Models Toolkit.
☆12Jul 7, 2022Updated 4 years ago
menajosep / tensorflow-doc2vec
View on GitHub
Reproduce the paper Distributed Representations of Sentences and Documents in tensorflow
☆14Apr 8, 2017Updated 9 years ago
Mddct / simple-tts
View on GitHub
（WIP）long form speech generatoins
☆30Apr 2, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
johncf / text2phones
View on GitHub
Attentional Neural Network that translates text to phones.
☆11Jan 25, 2018Updated 8 years ago
ictnlp / GMA
View on GitHub
Code for ACL 2022 findings paper "Gaussian Multi-head Attention for Simultaneous Machine Translation"
☆11Mar 31, 2022Updated 4 years ago
multimodal-art-projection / Open-Suno
View on GitHub
trying to reproduce suno v3
☆35Jan 29, 2025Updated last year
jinxulin / chinese-text2vec
View on GitHub
中文文本的向量表示方法（Sentence-BERT, CoSENT）的PyTorch简单实现，可以用于文本相似度计算。
☆10Mar 27, 2022Updated 4 years ago
Mddct / cosyvoice2-flow-optimized
View on GitHub
faster inference
☆27Jan 20, 2025Updated last year
lucidrains / audiolm-pytorch
View on GitHub
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
☆2,623Jan 12, 2025Updated last year
tennisonliu / noise_reduction
View on GitHub
Using Spectral Noise Gating (SNG) techniques to reduce background noise in streaming microphone input for enhanced vocal recognition
☆24Dec 10, 2018Updated 7 years ago
sigsep / bsseval
View on GitHub
audio source separation evaluation metrics
☆29Aug 27, 2019Updated 6 years ago
Fight-hawk / TextCNN-keras
View on GitHub
☆10May 6, 2020Updated 6 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
heraclex12 / vietpunc
View on GitHub
Vietnamese Punctuation Prediction using Pretrained Language Models
☆14May 8, 2022Updated 4 years ago
lonce / SPSI_Python
View on GitHub
Single Pass Spectrogram Inversion in a Jupyter Python notebook
☆34Aug 10, 2017Updated 8 years ago
Mddct / transformer-vocos
View on GitHub
☆35Sep 6, 2025Updated 10 months ago
sjhan91 / MusicBERT
View on GitHub
The implementation of "Systematic Analysis of Music Representations from BERT"
☆28May 23, 2023Updated 3 years ago
Hiroshiba / openjtalk-label-getter
View on GitHub
☆10Dec 10, 2021Updated 4 years ago
lifeiteng / NotebookTTS
View on GitHub
Text-To-Speech for NotebookLM
☆39Jul 20, 2025Updated last year
scottishfold0621 / ACMID
View on GitHub
☆26Apr 30, 2026Updated 2 months ago