google-deepmind / slowfast_nfnetsLinks
☆30Updated 3 years ago
Alternatives and similar repositories for slowfast_nfnets
Users that are interested in slowfast_nfnets are comparing it to the libraries listed below
Sorting:
- EVAR ~ Evaluation package for Audio Representations☆65Updated 3 weeks ago
- audioLIME: Listenable Explanations Using Source Separation☆37Updated 4 years ago
- Masked Modeling Duo: Towards a Universal Audio Pre-training Framework☆119Updated last month
- A list of resources that can help in research for automated audio captioning☆34Updated 4 years ago
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆99Updated last year
- A list of papers about audio captioning☆79Updated 3 years ago
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆154Updated 2 years ago
- Asteroid's filterbanks☆87Updated 9 months ago
- Who calls the shots? Rethinking Few-Shot Learning for Audio (WASPAA 2021)☆44Updated 3 years ago
- Benchmark popular audio i/o packages☆147Updated last year
- A library built for easier audio self-supervised training, downstream tasks evaluation☆131Updated last month
- Web-crawl for "Audio Retrieval with WavText5K and CLAP Training"☆50Updated 2 years ago
- UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation☆75Updated 4 years ago
- SA-toolkit: Speaker speech anonymization toolkit in python☆28Updated last month
- Inference code for PaSST, using the HEAR API.☆32Updated last year
- Simple baseline model for the HEAR benchmark☆23Updated last month
- ☆163Updated 3 years ago
- Embedded segmental K-means (ES-KMeans) in Python.☆15Updated last year
- Learning differentiable temporal resolution on time-series data.☆36Updated 2 years ago
- A repository for benchmarking neural vocoders by their quality and speed.☆211Updated 5 months ago
- Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations☆95Updated last year
- A collection of audio autoencoders, in PyTorch.☆43Updated 2 years ago
- ☆32Updated 4 years ago
- Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion☆143Updated 5 years ago
- Evaluation kit for the HEAR Benchmark☆61Updated last month
- DEPRECATED: Jamendo music dataset with time-aligned lyrics for lyrics alignment evaluation☆87Updated 6 months ago
- PyTorch Dataset for Speech and Music audio☆78Updated last year
- Unofficial implementation of FSD50k baselines for Sound Event Recognition☆26Updated last year
- WaveNet auto-ancoders for ZeroSpeech challenge 2020☆37Updated 3 years ago
- Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.☆93Updated 2 years ago