google-deepmind / slowfast_nfnets
☆30Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for slowfast_nfnets
- ARCH: Audio Representations benCHmark☆35Updated 2 months ago
- Training code and trained checkpoints for ASGAN.☆60Updated 10 months ago
- Masked Modeling Duo: Towards a Universal Audio Pre-training Framework☆76Updated 3 months ago
- A library built for easier audio self-supervised training, downstream tasks evaluation☆105Updated 2 months ago
- ☆36Updated 3 years ago
- EVAR ~ Evaluation package for Audio Representations☆43Updated this week
- Learning differentiable temporal resolution on time-series data.☆32Updated last year
- PyTorch Dataset for Speech and Music audio☆73Updated 3 months ago
- ☆32Updated 3 years ago
- Who calls the shots? Rethinking Few-Shot Learning for Audio (WASPAA 2021)☆40Updated 2 years ago
- ☆32Updated 2 years ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆75Updated last year
- (Hybrid) BYOL-S feature extractor using serab-byols package in pytorch.☆26Updated 6 months ago
- Official implementation of "Learning Music Audio Representations Via Weak Language Supervision" (ICASSP 2022)☆46Updated 2 years ago
- Web-crawl for "Audio Retrieval with WavText5K and CLAP Training"☆49Updated 2 years ago
- Official Implementation of Mockingjay in Pytorch☆52Updated last year
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆57Updated last year
- experiments about AudioSet☆43Updated last year
- WaveNet auto-ancoders for ZeroSpeech challenge 2020☆36Updated 2 years ago
- UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation☆70Updated 3 years ago
- The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training☆39Updated last year
- Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)☆32Updated last year
- This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fi…☆35Updated 3 months ago
- Accompanying code for our paper "Optimizing Short-Time Fourier Transform Parameters via Gradient Descent"☆31Updated 4 years ago
- Simple baseline model for the HEAR benchmark☆22Updated last week
- Inference code for PaSST, using the HEAR API.☆29Updated 10 months ago
- ☆33Updated 4 months ago
- Word Discovery in Visually Grounded, Self-Supervised Speech Models☆25Updated 11 months ago
- Conditioned U-Net for Music Source Separation☆20Updated 3 years ago
- ☆79Updated last year