google-deepmind / slowfast_nfnetsLinks
☆30Updated 3 years ago
Alternatives and similar repositories for slowfast_nfnets
Users that are interested in slowfast_nfnets are comparing it to the libraries listed below
Sorting:
- Official implementation of "Learning Music Audio Representations Via Weak Language Supervision" (ICASSP 2022)☆47Updated 6 months ago
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆68Updated 2 years ago
- PyTorch Dataset for Speech and Music audio☆76Updated 11 months ago
- ARCH: Audio Representations benCHmark☆46Updated 10 months ago
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆97Updated 11 months ago
- Inference code for PaSST, using the HEAR API.☆33Updated last year
- A list of papers about audio captioning☆77Updated 2 years ago
- Evaluation kit for the HEAR Benchmark☆59Updated last month
- ☆83Updated 2 years ago
- A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating …☆74Updated 2 weeks ago
- EVAR ~ Evaluation package for Audio Representations☆58Updated last week
- ☆32Updated 4 years ago
- A collection of audio autoencoders, in PyTorch.☆42Updated 2 years ago
- Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"☆188Updated 2 years ago
- easy-to-use implementation of the ISMIR 2013 Audio Degradation Toolbox☆50Updated 5 years ago
- Who calls the shots? Rethinking Few-Shot Learning for Audio (WASPAA 2021)☆43Updated 3 years ago
- ☆40Updated 5 years ago
- An invertible and differentiable implementation of the Constant-Q Transform (CQT).☆60Updated 2 years ago
- SA-toolkit: Speaker speech anonymization toolkit in python☆23Updated 3 months ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆36Updated last year
- UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation☆74Updated 3 years ago
- Audio activity detector based on per-channel energy normalization (PCEN)☆29Updated 6 years ago
- Masked Modeling Duo: Towards a Universal Audio Pre-training Framework☆103Updated 10 months ago
- Unsupervised Representation Learning for Singing Voice Separation☆22Updated 2 years ago
- Simple baseline model for the HEAR benchmark☆23Updated last week
- ☆32Updated 3 years ago
- CNN-based singing voice detection experiments☆37Updated 7 years ago
- Improving Recording Device Generalization using Impulse Response Augmentation☆17Updated 2 months ago
- Training code and trained checkpoints for ASGAN.☆62Updated last year
- Multipurpose Multi Speaker Mixture Signal Generator☆44Updated 4 months ago