BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation
☆234Apr 26, 2023Updated 3 years ago
Alternatives and similar repositories for byol-a
Users that are interested in byol-a are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- EVAR ~ Evaluation package for Audio Representations☆78Feb 19, 2026Updated 3 months ago
- Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.☆93Dec 22, 2022Updated 3 years ago
- Official PyTorch implementation of Contrastive Learning of Musical Representations☆336Jul 25, 2024Updated last year
- (Hybrid) BYOL-S feature extractor using serab-byols package in pytorch.☆27Apr 20, 2024Updated 2 years ago
- COLA contrastive pre-training method implemented in PyTorch☆44Jan 27, 2021Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Dec 16, 2022Updated 3 years ago
- Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".☆421Aug 14, 2022Updated 3 years ago
- Efficient Training of Audio Transformers with Patchout☆382Jan 12, 2024Updated 2 years ago
- Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model☆26Apr 26, 2023Updated 3 years ago
- Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations☆99Feb 20, 2026Updated 3 months ago
- ToyADMOS2: Another dataset of miniature-machine operating sounds for anomalous sound detection under domain shift conditions 🚗 🚃☆21Apr 16, 2024Updated 2 years ago
- Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.☆1,148Nov 24, 2025Updated 6 months ago
- A library built for easier audio self-supervised training, downstream tasks evaluation☆139Sep 25, 2025Updated 8 months ago
- Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".☆1,455May 21, 2023Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆34Jun 25, 2021Updated 4 years ago
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆164Nov 12, 2022Updated 3 years ago
- Official implementation of "Learning Music Audio Representations Via Weak Language Supervision" (ICASSP 2022)☆47Dec 3, 2024Updated last year
- Audio transformations library for PyTorch☆239Apr 19, 2022Updated 4 years ago
- ☆55Jun 3, 2020Updated 5 years ago
- A library for speech data augmentation in time-domain☆687Aug 30, 2021Updated 4 years ago
- Audio processing by using pytorch 1D convolution network☆1,124Updated this week
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆33Apr 22, 2026Updated last month
- This repo hosts the code and models of "Masked Autoencoders that Listen".☆663Apr 5, 2024Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.☆370Oct 12, 2021Updated 4 years ago
- RWCP-SSD-Onomatopoeia☆24Jun 28, 2023Updated 2 years ago
- Masked Modeling Duo: Towards a Universal Audio Pre-training Framework☆152Feb 23, 2026Updated 3 months ago
- Word Discovery in Visually Grounded, Self-Supervised Speech Models☆27Dec 4, 2023Updated 2 years ago
- ☆13Jun 18, 2021Updated 4 years ago
- A GPU-optional modular synthesizer in pytorch, 16200x faster than realtime, for audio ML researchers.☆378Feb 16, 2026Updated 3 months ago
- COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations☆48Jul 25, 2024Updated last year
- The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training☆51Dec 17, 2024Updated last year
- ☆18Apr 12, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Python library for downloading, loading & working with sound datasets☆356Sep 23, 2025Updated 8 months ago
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆93Jun 9, 2022Updated 3 years ago
- A differentiable version of SPTK☆200May 18, 2026Updated last week
- The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".☆12Oct 25, 2021Updated 4 years ago
- An official repo for the paper "Adapting Language-Audio Models as Few-Shot Audio Learners"☆31May 31, 2023Updated 2 years ago
- Autoencoder Based Real-Time Timbre Interpolation Algorithm☆12Aug 17, 2020Updated 5 years ago
- Audio Captioning datasets for PyTorch.☆128Mar 25, 2026Updated 2 months ago