google-deepmind / slowfast_nfnets
☆30Updated 2 years ago
Alternatives and similar repositories for slowfast_nfnets
Users that are interested in slowfast_nfnets are comparing it to the libraries listed below
Sorting:
- Official implementation of "Learning Music Audio Representations Via Weak Language Supervision" (ICASSP 2022)☆46Updated 5 months ago
- ARCH: Audio Representations benCHmark☆45Updated 8 months ago
- ☆32Updated 4 years ago
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆96Updated 9 months ago
- EVAR ~ Evaluation package for Audio Representations☆54Updated last week
- Asteroid's filterbanks☆84Updated 4 months ago
- UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation☆74Updated 3 years ago
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆65Updated 2 years ago
- PyTorch Dataset for Speech and Music audio☆75Updated 10 months ago
- ☆59Updated 4 years ago
- A library built for easier audio self-supervised training, downstream tasks evaluation☆117Updated 8 months ago
- Masked Modeling Duo: Towards a Universal Audio Pre-training Framework☆99Updated 9 months ago
- Evaluation kit for the HEAR Benchmark☆59Updated this week
- Official repository for the paper "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs"☆14Updated 7 months ago
- ☆36Updated 4 years ago
- Inference code for PaSST, using the HEAR API.☆33Updated last year
- A collection of audio autoencoders, in PyTorch.☆40Updated 2 years ago
- ☆83Updated last year
- Simple baseline model for the HEAR benchmark☆23Updated last month
- A list of resources that can help in research for automated audio captioning☆34Updated 4 years ago
- A list of papers about audio captioning☆77Updated 2 years ago
- The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training☆42Updated 4 months ago
- audioLIME: Listenable Explanations Using Source Separation☆35Updated 3 years ago
- Baseline systems for the FSD50K dataset☆69Updated 3 years ago
- Improving Recording Device Generalization using Impulse Response Augmentation☆16Updated 2 weeks ago
- Official Pytorch implementation of PULSE: Positive–Unlabelled Learning for audio Signal Enhancement (Best Paper Award at ICASSP 2023)☆41Updated last year
- Unofficial implementation of FSD50k baselines for Sound Event Recognition☆26Updated last year
- A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating …☆67Updated last month
- Who calls the shots? Rethinking Few-Shot Learning for Audio (WASPAA 2021)☆43Updated 2 years ago
- Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations☆90Updated 10 months ago