SiavashShams / ssambaLinks
[SLT'24] The official implementation of SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model
☆130Updated 2 weeks ago
Alternatives and similar repositories for ssamba
Users that are interested in ssamba are comparing it to the libraries listed below
Sorting:
- Official implementation for our paper "Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations"☆41Updated 3 months ago
- Masked Modeling Duo: Towards a Universal Audio Pre-training Framework☆122Updated last week
- A library built for easier audio self-supervised training, downstream tasks evaluation☆132Updated last month
- ☆101Updated last year
- Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"☆159Updated 11 months ago
- Learning differentiable temporal resolution on time-series data.☆36Updated 3 years ago
- ☆188Updated 11 months ago
- ☆103Updated 6 months ago
- ☆10Updated 2 months ago
- [ICLR 2025] Enhancing Self-Supervised Models with Audio Mixtures for Polyphonic Soundscapes☆52Updated last month
- This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fi…☆38Updated last year
- EVAR ~ Evaluation package for Audio Representations☆68Updated last month
- This repository aims to collect Transformer-based sound event detection (SED) algorithms.☆78Updated 2 weeks ago
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆88Updated 3 years ago
- ☆23Updated 8 months ago
- AudioLDM training, finetuning, evaluation and inference.☆14Updated last year
- ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation☆28Updated last year
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆108Updated last year
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆99Updated last year
- This package aims at simplifying the download of the AudioSet dataset.☆55Updated 4 months ago
- TODO☆43Updated 2 years ago
- Single channel speech source separation by diffusion process (ICASSP 2023)☆120Updated last year
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆75Updated 6 months ago
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆40Updated 6 months ago
- A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline☆190Updated 11 months ago
- Prediction of sound event bounding boxes (SEBBs)☆31Updated last year
- ☆25Updated last year
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆154Updated 3 years ago
- ☆58Updated 2 years ago
- Implementation of the paper "Self-supervised Learning with Random-projection Quantizer for Speech Recognition" in Pytorch.☆87Updated 2 years ago