google-deepmind / slowfast_nfnetsLinks
☆30Updated 3 years ago
Alternatives and similar repositories for slowfast_nfnets
Users that are interested in slowfast_nfnets are comparing it to the libraries listed below
Sorting:
- A list of papers about audio captioning☆79Updated 3 years ago
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆99Updated last year
- Unofficial implementation of FSD50k baselines for Sound Event Recognition☆27Updated last year
- audioLIME: Listenable Explanations Using Source Separation☆37Updated 4 years ago
- Asteroid's filterbanks☆88Updated 10 months ago
- Simple baseline model for the HEAR benchmark☆23Updated this week
- PyTorch Dataset for Speech and Music audio☆79Updated last year
- EVAR ~ Evaluation package for Audio Representations☆68Updated this week
- A list of resources that can help in research for automated audio captioning☆34Updated 4 years ago
- WaveNet auto-ancoders for ZeroSpeech challenge 2020☆37Updated 3 years ago
- Official implementation of "Learning Music Audio Representations Via Weak Language Supervision" (ICASSP 2022)☆47Updated last year
- Official Implementation of Mockingjay in Pytorch☆55Updated 2 years ago
- ☆14Updated 2 years ago
- Inference code for PaSST, using the HEAR API.☆32Updated last year
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆69Updated 2 years ago
- Embedded segmental K-means (ES-KMeans) in Python.☆15Updated last year
- SA-toolkit: Speaker speech anonymization toolkit in python☆28Updated 2 months ago
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆154Updated 3 years ago
- ☆32Updated 4 years ago
- A library built for easier audio self-supervised training, downstream tasks evaluation☆133Updated 2 months ago
- ☆37Updated 4 years ago
- ☆163Updated 3 years ago
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆89Updated 3 years ago
- Learning differentiable temporal resolution on time-series data.☆36Updated 3 years ago
- Phoneme segmentation using pre-trained speech models☆55Updated 3 years ago
- A repository for benchmarking neural vocoders by their quality and speed.☆212Updated 6 months ago
- ☆41Updated 5 years ago
- Masked Modeling Duo: Towards a Universal Audio Pre-training Framework☆126Updated this week
- UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation☆75Updated 4 years ago
- Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion☆143Updated 5 years ago