google-deepmind / slowfast_nfnets
☆30Updated 2 years ago
Alternatives and similar repositories for slowfast_nfnets:
Users that are interested in slowfast_nfnets are comparing it to the libraries listed below
- Official implementation of "Learning Music Audio Representations Via Weak Language Supervision" (ICASSP 2022)☆46Updated 3 months ago
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆95Updated 8 months ago
- Inference code for PaSST, using the HEAR API.☆31Updated last year
- ☆32Updated 4 years ago
- EVAR ~ Evaluation package for Audio Representations☆47Updated 4 months ago
- ARCH: Audio Representations benCHmark☆43Updated 7 months ago
- PyTorch Dataset for Speech and Music audio☆73Updated 8 months ago
- Simple baseline model for the HEAR benchmark☆23Updated last week
- Asteroid's filterbanks☆83Updated 2 months ago
- A list of resources that can help in research for automated audio captioning☆34Updated 4 years ago
- Masked Modeling Duo: Towards a Universal Audio Pre-training Framework☆89Updated 7 months ago
- A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating …☆63Updated 2 weeks ago
- million song dataset split for extended clean tag & artist-level stratified☆48Updated last year
- experiments about AudioSet☆44Updated last year
- A library built for easier audio self-supervised training, downstream tasks evaluation☆115Updated 7 months ago
- An invertible and differentiable implementation of the Constant-Q Transform (CQT).☆58Updated 2 years ago
- Who calls the shots? Rethinking Few-Shot Learning for Audio (WASPAA 2021)☆42Updated 2 years ago
- ☆58Updated 4 years ago
- A list of papers about audio captioning☆77Updated 2 years ago
- SA-toolkit: Speaker speech anonymization toolkit in python☆23Updated last week
- Baseline systems for the FSD50K dataset☆68Updated 3 years ago
- Training code and trained checkpoints for ASGAN.☆62Updated last year
- Unofficial implementation of FSD50k baselines for Sound Event Recognition☆24Updated 11 months ago
- Evaluation kit for the HEAR Benchmark☆58Updated this week
- Learning differentiable temporal resolution on time-series data.☆36Updated 2 years ago
- Conditioned U-Net for Music Source Separation☆20Updated 3 years ago
- UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation☆74Updated 3 years ago
- audioLIME: Listenable Explanations Using Source Separation☆35Updated 3 years ago
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆64Updated last year
- Source code for "MusCaps: Generating Captions for Music Audio" (IJCNN 2021)☆80Updated 3 months ago