google-deepmind / slowfast_nfnetsLinks
☆30Updated 3 years ago
Alternatives and similar repositories for slowfast_nfnets
Users that are interested in slowfast_nfnets are comparing it to the libraries listed below
Sorting:
- EVAR ~ Evaluation package for Audio Representations☆73Updated last week
- PyTorch Dataset for Speech and Music audio☆80Updated last year
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆100Updated last year
- A list of papers about audio captioning☆79Updated 3 years ago
- Asteroid's filterbanks☆88Updated last year
- A list of resources that can help in research for automated audio captioning☆34Updated 4 years ago
- Pytorch Reimplementation of DiffWave Vocoder: a high quality, fast, and small neural vocoder.☆91Updated 4 years ago
- Benchmark popular audio i/o packages☆151Updated 2 years ago
- Conditioned U-Net for Music Source Separation☆20Updated 4 years ago
- Simple baseline model for the HEAR benchmark☆23Updated last week
- Implementation of DiffWave and SaShiMi audio generation models☆128Updated 2 years ago
- A library built for easier audio self-supervised training, downstream tasks evaluation☆136Updated 4 months ago
- audioLIME: Listenable Explanations Using Source Separation☆37Updated 4 years ago
- Masked Modeling Duo: Towards a Universal Audio Pre-training Framework☆136Updated 2 months ago
- ☆42Updated 5 years ago
- Inference code for PaSST, using the HEAR API.☆33Updated 2 years ago
- ☆32Updated 5 years ago
- Evaluation kit for the HEAR Benchmark☆62Updated last week
- A repository for benchmarking neural vocoders by their quality and speed.☆212Updated 8 months ago
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆69Updated 2 years ago
- ☆86Updated 2 years ago
- Official repository for the paper "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs"☆21Updated 5 months ago
- UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation☆76Updated 4 years ago
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆159Updated 3 years ago
- easy-to-use implementation of the ISMIR 2013 Audio Degradation Toolbox☆53Updated 6 years ago
- Official repository of the paper "Solving Audio Inverse Problems with a Diffusion Model", submitted to ICASSP 23☆121Updated 2 years ago
- A collection of useful audio datasets and transforms for PyTorch.☆144Updated 3 years ago
- million song dataset split for extended clean tag & artist-level stratified☆52Updated 2 years ago
- A collection of audio autoencoders, in PyTorch.☆44Updated 2 years ago
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆90Updated 3 years ago