SarthakYadav / audaxLinks
A home for audio ML in JAX. Has common features, learnable frontends, pretrained supervised and self-supervised models.
☆68Updated 2 years ago
Alternatives and similar repositories for audax
Users that are interested in audax are comparing it to the libraries listed below
Sorting:
- ☆66Updated 9 months ago
- ☆31Updated 2 years ago
- ☆30Updated 2 years ago
- Continuous Augmented Positional Embeddings (CAPE) implementation for PyTorch☆40Updated 2 years ago
- Framework for writing deep learning training loops. Lightweight, and retaining full freedom to design as you see fits. It handles checkpo…☆112Updated last year
- Pytorch Implementation of WaveNODE☆64Updated 4 years ago
- ☆15Updated 2 years ago
- [ICML 2024] SIRFShampoo: Structured inverse- and root-free Shampoo in PyTorch (https://arxiv.org/abs/2402.03496)☆14Updated 7 months ago
- A minimal Pytorch Implementation of Stochastically Quantized Variational AutoEncoder (SQ-VAE) by Sony☆31Updated last year
- Fast and differentiable hidden Markov model in C++☆17Updated 2 years ago
- ☆22Updated 7 months ago
- Code for the paper "Improving Sound Event Classification by Increasing Shift Invariance in Convolutional Neural Networks".☆13Updated 2 years ago
- Implementation of the DIVA model of speech acquisition and production using PyTorch☆21Updated 2 years ago
- ☆56Updated 2 years ago
- Accompanying code for our paper "Optimizing Short-Time Fourier Transform Parameters via Gradient Descent"☆33Updated 4 years ago
- Implementation of DiffWave and SaShiMi audio generation models☆122Updated 2 years ago
- Speech in Flax/JAX☆15Updated 2 years ago
- Evaluation kit for the HEAR Benchmark☆59Updated last week
- Local Attention - Flax module for Jax☆22Updated 4 years ago
- Applications using the GTN library and code to reproduce experiments in "Differentiable Weighted Finite-State Transducers"☆83Updated 2 years ago
- logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even whe…☆35Updated 8 months ago
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆87Updated 7 months ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆76Updated last year
- PyTorch Dataset for Speech and Music audio☆76Updated 10 months ago
- Jax/Flax implementation of Variational-DiffWave.☆40Updated 3 years ago
- PyTorch implementation of the paper "NanoFlow: Scalable Normalizing Flows with Sublinear Parameter Complexity." (NeurIPS 2020)☆66Updated 4 years ago
- A collection of utilities for handling IPA phones.☆25Updated last year
- Contrastive Language-Audio Pretraining☆15Updated 4 years ago
- Applying reinforcement learning to perform source separation.☆23Updated 4 years ago
- ☆32Updated 3 years ago