google-deepmind / slowfast_nfnetsLinks
☆30Updated 3 years ago
Alternatives and similar repositories for slowfast_nfnets
Users that are interested in slowfast_nfnets are comparing it to the libraries listed below
Sorting:
- Asteroid's filterbanks☆88Updated last year
- EVAR ~ Evaluation package for Audio Representations☆72Updated last month
- A list of papers about audio captioning☆79Updated 3 years ago
- Simple baseline model for the HEAR benchmark☆23Updated last month
- ☆37Updated 4 years ago
- A list of resources that can help in research for automated audio captioning☆34Updated 4 years ago
- Embedded segmental K-means (ES-KMeans) in Python.☆15Updated last year
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆101Updated last year
- ☆32Updated 5 years ago
- ☆42Updated 5 years ago
- UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation☆75Updated 4 years ago
- Who calls the shots? Rethinking Few-Shot Learning for Audio (WASPAA 2021)☆43Updated 3 years ago
- This repository contains the baseline system for CHiME-8 MMCSG challenge focusing on transcribing both sides of a conversation where one …☆39Updated last year
- Unofficial implementation of FSD50k baselines for Sound Event Recognition☆27Updated last year
- Evaluation kit for the HEAR Benchmark☆62Updated last month
- PyTorch Dataset for Speech and Music audio☆80Updated last year
- audioLIME: Listenable Explanations Using Source Separation☆37Updated 4 years ago
- Benchmark popular audio i/o packages☆151Updated 2 years ago
- A collection of papers related to speech model compression☆26Updated 2 years ago
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆69Updated 2 years ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆41Updated 5 years ago
- WaveNet auto-ancoders for ZeroSpeech challenge 2020☆37Updated 3 years ago
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆157Updated 3 years ago
- SA-toolkit: Speaker speech anonymization toolkit in python☆30Updated 4 months ago
- Conditioned U-Net for Music Source Separation☆20Updated 4 years ago
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆89Updated 3 years ago
- Word Discovery in Visually Grounded, Self-Supervised Speech Models☆26Updated 2 years ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆60Updated 3 years ago
- easy-to-use implementation of the ISMIR 2013 Audio Degradation Toolbox☆53Updated 6 years ago
- Training code and trained checkpoints for ASGAN.☆62Updated 2 years ago