google-deepmind / slowfast_nfnetsLinks
☆30Updated 3 years ago
Alternatives and similar repositories for slowfast_nfnets
Users that are interested in slowfast_nfnets are comparing it to the libraries listed below
Sorting:
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆98Updated last year
- A list of papers about audio captioning☆79Updated 3 years ago
- Masked Modeling Duo: Towards a Universal Audio Pre-training Framework☆110Updated last year
- EVAR ~ Evaluation package for Audio Representations☆64Updated 2 months ago
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆150Updated 2 years ago
- Asteroid's filterbanks☆86Updated 7 months ago
- PyTorch Dataset for Speech and Music audio☆78Updated last year
- Confidence interval computation for evaluation in machine learning using the bootstrapping approach☆87Updated last year
- Phoneme segmentation using pre-trained speech models☆55Updated 2 years ago
- A library built for easier audio self-supervised training, downstream tasks evaluation☆128Updated last year
- ☆163Updated 2 years ago
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆70Updated 2 years ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆78Updated last year
- A repository for benchmarking neural vocoders by their quality and speed.☆210Updated 3 months ago
- DEPRECATED: Jamendo music dataset with time-aligned lyrics for lyrics alignment evaluation☆87Updated 4 months ago
- Benchmark popular audio i/o packages☆146Updated last year
- Official implementation of "Learning Music Audio Representations Via Weak Language Supervision" (ICASSP 2022)☆48Updated 8 months ago
- A list of resources that can help in research for automated audio captioning☆34Updated 4 years ago
- Deep Articulatory Synthesis and Inversion☆52Updated last year
- Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion☆142Updated 4 years ago
- SA-toolkit: Speaker speech anonymization toolkit in python☆26Updated 5 months ago
- Learning differentiable temporal resolution on time-series data.☆36Updated 2 years ago
- Evaluation kit for the HEAR Benchmark☆59Updated last week
- Who calls the shots? Rethinking Few-Shot Learning for Audio (WASPAA 2021)☆43Updated 3 years ago
- Official code for Wav2Seq☆96Updated 3 years ago
- UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation☆75Updated 4 years ago
- The official repository for Audio ALBERT☆67Updated 3 years ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆60Updated 2 years ago
- Unofficial implementation of FSD50k baselines for Sound Event Recognition☆26Updated last year
- Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint☆73Updated 2 years ago