SarthakYadav/audax

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SarthakYadav/audax)

SarthakYadav / audax

A home for audio ML in JAX. Has common features, learnable frontends, pretrained supervised and self-supervised models.

☆72

Alternatives and similar repositories for audax

Users that are interested in audax are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

boris-kuz / jaxloudnorm
View on GitHub
Jax implementation of a flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm
☆13Jan 29, 2025Updated last year
mlysy / rodeo-legacy
View on GitHub
Probabilistic Solution of Differential Equations
☆13Jun 19, 2022Updated 4 years ago
Joshuaalbert / jaxnlds
View on GitHub
Inference on non-linear dynamical systems written in JAX
☆11Aug 20, 2020Updated 5 years ago
jenni-ai / T2FW
View on GitHub
Fine-Tuning Pre-trained Transformers into Decaying Fast Weights
☆20Oct 9, 2022Updated 3 years ago
ga642381 / RobustVC
View on GitHub
**ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…
☆24Sep 27, 2022Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
brianfitzgerald / jax-mmdit
View on GitHub
Implementation of Diffusion Transformers and Rectified Flow in Jax
☆27Jul 9, 2024Updated 2 years ago
popcornell / MicRank
View on GitHub
MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.
☆22Apr 8, 2021Updated 5 years ago
mutiann / speech_rankings
View on GitHub
A CSRankings-like index for speech researchers
☆35Oct 16, 2024Updated last year
SarthakYadav / fsd50k-pytorch
View on GitHub
Unofficial implementation of FSD50k baselines for Sound Event Recognition
☆27Apr 27, 2024Updated 2 years ago
nc-ai / speech
View on GitHub
☆17Aug 27, 2025Updated 11 months ago
cgarciae / treex
View on GitHub
A Pytree Module system for Deep Learning in JAX
☆212Feb 26, 2023Updated 3 years ago
NTT123 / pax
View on GitHub
A stateful pytree library for training neural networks.
☆22Aug 31, 2025Updated 11 months ago
asteroid-team / pytorch-pit
View on GitHub
Permutation invariant training in PyTorch
☆13Oct 2, 2020Updated 5 years ago
ICLR-DAP / Deep-Audio-Prior
View on GitHub
Anonymous ICLR Submission
☆14Sep 25, 2019Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
MTG / playlists-stat-analysis
View on GitHub
Tools for Analyzing Popularity and Semantic Diversity of a Playlist Dataset
☆10Jun 17, 2024Updated 2 years ago
tuanio / nextformer
View on GitHub
PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"
☆10Dec 15, 2022Updated 3 years ago
DiffEqML / tutorials
View on GitHub
☆11Apr 14, 2022Updated 4 years ago
chrisdonahue / fall23-phd-prospectives
View on GitHub
Info for prospective PhD students for Chris Donahue's lab at CMU starting Fall 23.
☆12Nov 13, 2022Updated 3 years ago
vliu15 / adversarial-tts
View on GitHub
End-to-end Text-to-Speech with Generative Adversarial Networks
☆20Feb 6, 2021Updated 5 years ago
WangHelin1997 / DuTa-VC
View on GitHub
Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…
☆38Dec 5, 2023Updated 2 years ago
DBraun / DAC-JAX
View on GitHub
JAX Implementations of Descript Audio Codec and EnCodec
☆38Mar 30, 2025Updated last year
michalwols / yann
View on GitHub
Yet Another Neural Network Library 🤔
☆27Jul 21, 2026Updated last week
google-deepmind / jaxline
View on GitHub
☆164Dec 13, 2023Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
WangHelin1997 / Aty-TTS
View on GitHub
Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech
☆11May 14, 2025Updated last year
rhoposit / icassp2021
View on GitHub
☆15May 8, 2021Updated 5 years ago
dafyddg / RFA
View on GitHub
Implementation of the Rhythm Formant Analysis methodology for identifying speech rhythms and rhythm variation in the low frequency spectr…
☆17Apr 27, 2023Updated 3 years ago
robflynnyh / long-context-asr
View on GitHub
Code for the paper: How Much Context Does My Attention-Based ASR System Need?
☆11Jul 3, 2026Updated 3 weeks ago
lucidrains / CLAP
View on GitHub
Contrastive Language-Audio Pretraining
☆15May 18, 2021Updated 5 years ago
patil-suraj / simple-diffusion
View on GitHub
An implementation of simple diffusion in PyTorch (and JAX)
☆34Jan 28, 2023Updated 3 years ago
microsoft / knossos-ksc
View on GitHub
Compiler with automatic differentiation
☆49Oct 18, 2023Updated 2 years ago
WingZLeung / TTDS
View on GitHub
Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.
☆13Mar 15, 2025Updated last year
topel / audioset-convnext-inf
View on GitHub
Adapting a ConvNeXt model to audio classification on AudioSet
☆27Feb 19, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
otnemrasordep / ProgGP
View on GitHub
A dataset of 173 progressive metal songs, in both GuitarPro and token formats, as per the specifications in DadaGP.
☆18Nov 19, 2024Updated last year
idontgetoutmuch / Kalman
View on GitHub
Extended Kalman filtering in Haskell
☆23Oct 31, 2018Updated 7 years ago
sbi-benchmark / diffeqtorch
View on GitHub
DifferentialEquations.jl with PyTorch
☆11Oct 12, 2022Updated 3 years ago
xinjli / asr2k
View on GitHub
asr2k
☆51Jun 2, 2024Updated 2 years ago
fschmid56 / EfficientAT_HEAR
View on GitHub
Evaluate EfficientAT models on the Holistic Evaluation of Audio Representations Benchmark.
☆34Jun 23, 2023Updated 3 years ago
groupmm / libf0
View on GitHub
A Python Library for Fundamental Frequency Estimation in Music Recordings
☆55Jun 5, 2026Updated last month
tzuhsien / Voice-conversion-evaluation
View on GitHub
An evaluation toolkit for voice conversion models.
☆42Jul 11, 2021Updated 5 years ago