BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation
☆234Apr 26, 2023Updated 3 years ago
Alternatives and similar repositories for byol-a
Users that are interested in byol-a are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- EVAR ~ Evaluation package for Audio Representations☆81Feb 19, 2026Updated 4 months ago
- Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.☆93Dec 22, 2022Updated 3 years ago
- Official PyTorch implementation of Contrastive Learning of Musical Representations☆337Jul 25, 2024Updated last year
- (Hybrid) BYOL-S feature extractor using serab-byols package in pytorch.☆27Apr 20, 2024Updated 2 years ago
- COLA contrastive pre-training method implemented in PyTorch☆44Jan 27, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Dec 16, 2022Updated 3 years ago
- Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".☆426Aug 14, 2022Updated 3 years ago
- Efficient Training of Audio Transformers with Patchout☆385Jan 12, 2024Updated 2 years ago
- Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model☆26Apr 26, 2023Updated 3 years ago
- Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations☆99Feb 20, 2026Updated 4 months ago
- ToyADMOS2: Another dataset of miniature-machine operating sounds for anomalous sound detection under domain shift conditions 🚗 🚃☆21Apr 16, 2024Updated 2 years ago
- Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.☆1,155Nov 24, 2025Updated 7 months ago
- A library built for easier audio self-supervised training, downstream tasks evaluation☆141Sep 25, 2025Updated 9 months ago
- Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".☆1,461May 21, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆34Jun 25, 2021Updated 5 years ago
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆165Nov 12, 2022Updated 3 years ago
- Official implementation of "Learning Music Audio Representations Via Weak Language Supervision" (ICASSP 2022)☆47Dec 3, 2024Updated last year
- Audio transformations library for PyTorch☆239Apr 19, 2022Updated 4 years ago
- ☆55Jun 3, 2020Updated 6 years ago
- A library for speech data augmentation in time-domain☆689Aug 30, 2021Updated 4 years ago
- Audio processing by using pytorch 1D convolution network☆1,127May 21, 2026Updated last month
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆34Apr 22, 2026Updated 2 months ago
- This repo hosts the code and models of "Masked Autoencoders that Listen".☆669Apr 5, 2024Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.☆372Oct 12, 2021Updated 4 years ago
- RWCP-SSD-Onomatopoeia☆24Jun 28, 2023Updated 3 years ago
- Masked Modeling Duo: Towards a Universal Audio Pre-training Framework☆157Feb 23, 2026Updated 4 months ago
- Word Discovery in Visually Grounded, Self-Supervised Speech Models☆27Dec 4, 2023Updated 2 years ago
- ☆12Jun 18, 2021Updated 5 years ago
- A GPU-optional modular synthesizer in pytorch, 16200x faster than realtime, for audio ML researchers.☆381Feb 16, 2026Updated 4 months ago
- COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations☆48Jul 25, 2024Updated last year
- The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training☆51Dec 17, 2024Updated last year
- ☆18Apr 12, 2021Updated 5 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Python library for downloading, loading & working with sound datasets☆356Jun 23, 2026Updated last week
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆93Jun 9, 2022Updated 4 years ago
- A differentiable version of SPTK☆201Jun 2, 2026Updated last month
- The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".☆12Oct 25, 2021Updated 4 years ago
- An official repo for the paper "Adapting Language-Audio Models as Few-Shot Audio Learners"☆31May 31, 2023Updated 3 years ago
- Autoencoder Based Real-Time Timbre Interpolation Algorithm☆12Aug 17, 2020Updated 5 years ago
- Audio Captioning datasets for PyTorch.☆129Mar 25, 2026Updated 3 months ago