SarthakYadav/audio-mamba-official

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SarthakYadav/audio-mamba-official)

SarthakYadav / audio-mamba-official

Official implementation for our paper "Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations"

☆44

Alternatives and similar repositories for audio-mamba-official

Users that are interested in audio-mamba-official are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kaistmm / Audio-Mamba-AuM
View on GitHub
Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"
☆173Nov 24, 2024Updated last year
HolgerBovbjerg / data2vec-KWS
View on GitHub
This repository contains code for applying Data2Vec to pretrain Keyword Transformer model as described in "Improving Label-Deficient Keyw…
☆32Mar 6, 2025Updated last year
SiavashShams / ssamba
View on GitHub
[SLT'24] The official implementation of SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model
☆140Nov 5, 2025Updated 8 months ago
ta012 / DTFAT
View on GitHub
[AAAI 2024] DTF-AT: Decoupled Time-Frequency Audio Transformer for Event Classification
☆12Mar 10, 2025Updated last year
xjchenGit / SingGraph
View on GitHub
Official repository for the paper Singing Voice Graph Modeling for SingFake Detection (Interspeech 2024).
☆24Sep 19, 2025Updated 10 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Sara-Ahmed / ASiT
View on GitHub
ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation
☆30Mar 10, 2024Updated 2 years ago
yzyouzhang / SASV_PR
View on GitHub
Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"
☆18Jun 24, 2022Updated 4 years ago
AlanBaade / MAE-AST-Public
View on GitHub
Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer
☆93Jun 9, 2022Updated 4 years ago
Torabiy / HLS-CMDS
View on GitHub
Heart and Lung Sounds Dataset Recorded from a Clinical Manikin using Digital Stethoscope (HLS-CMDS)
☆19May 13, 2026Updated 2 months ago
kaen2891 / adversarial_fine-tuning_using_generated_respiratory_sound
View on GitHub
(NeurIPS 2023 Workshop on DGM4H) Official Implementation of "Adversarial Fine-tuning using Generated Respiratory Sound to Address Class I…
☆19Dec 5, 2024Updated last year
cyjie429 / RawBMamba
View on GitHub
☆31Jul 15, 2024Updated 2 years ago
kaen2891 / stethoscope-guided_supervised_contrastive_learning
View on GitHub
(ICASSP 2024) Official Implementation of "Stethoscope-guided Supervised Contrastive Learning for Cross-domin Adaptation on Respiratory So…
☆18Dec 5, 2024Updated last year
wngh1187 / Diff-SV
View on GitHub
Pytorch implementation of Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Pro…
☆23Dec 14, 2023Updated 2 years ago
WangHelin1997 / LibriLightMix-WHAMR
View on GitHub
Python scripts to create noisy and reverberant 2-speaker mixture audio with Libri-Light and WHAM
☆17Nov 7, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
talkingnow / HM-Conformer
View on GitHub
☆27Jan 17, 2024Updated 2 years ago
yzyouzhang / Audio_Research_in_US
View on GitHub
Audio Research in US. US-based professors who work on audio (music, speech, acoustics). For students who would like to apply for RA, PhD,…
☆27Feb 27, 2026Updated 5 months ago
wooseok-shin / MetricGAN-OKD
View on GitHub
Official PyTorch implementation of "Multi-Metric Optimization of MetricGAN via Online Knowledge Distillation for Speech Enhancement" (ICM…
☆30Mar 18, 2025Updated last year
diggerdu / AudioMamba
View on GitHub
☆12Jun 1, 2024Updated 2 years ago
AmphionTeam / SD-Eval
View on GitHub
[NeurIPS 2024] SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words
☆57Jun 25, 2024Updated 2 years ago
stanfordmlgroup / selfsupervised-lungandheartsounds
View on GitHub
☆19Nov 20, 2021Updated 4 years ago
deegy666 / ADD-RSC
View on GitHub
Code repository for ‘Adaptive Differential Denoising for Respiratory Sounds Classification’
☆23Dec 19, 2025Updated 7 months ago
umbertocappellazzo / PETL_AST
View on GitHub
This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" [IEEE MLSP 2024] …
☆41Jul 31, 2024Updated last year
kyegomez / AudioMamba
View on GitHub
Implementation of the paper: "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning" in pytorch
☆15Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
tangjjbetsy / RHEPP-Transformer
View on GitHub
We propose a novel approach for reconstructing human expressiveness in piano performance with a multi-layer bi-directional Transformer. (…
☆21May 16, 2024Updated 2 years ago
Audio-AGI / dcase2024_task9_baseline
View on GitHub
Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"
☆26Mar 27, 2024Updated 2 years ago
xiquan-li / TinyMU
View on GitHub
[ICASSP 2026] TinyMU: A Compact Audio Language Model for Music Understanding
☆37Apr 20, 2026Updated 3 months ago
ashi-ta / speechGLUE
View on GitHub
SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.
☆13Jun 2, 2023Updated 3 years ago
YuanGongND / ssast
View on GitHub
Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".
☆428Aug 14, 2022Updated 3 years ago
huutuongtu / Lightvoc
View on GitHub
LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM
☆18May 17, 2024Updated 2 years ago
midas-research / speechmix
View on GitHub
☆12Oct 2, 2020Updated 5 years ago
RoyChao19477 / SEMamba
View on GitHub
This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)
☆273Dec 12, 2025Updated 7 months ago
wilkinghoff / DSpAST
View on GitHub
Code for the paper "DSpAST: Disentangled Representations for Spatial Audio Reasoning with Large Language Models"
☆17Oct 23, 2025Updated 9 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
yangdongchao / UniAudio2Demo
View on GitHub
☆26Feb 10, 2026Updated 5 months ago
kkoutini / passt_hear21
View on GitHub
Inference code for PaSST, using the HEAR API.
☆35Jan 2, 2024Updated 2 years ago
slp-rl / PAST
View on GitHub
☆48Jul 7, 2025Updated last year
haoheliu / SemantiCodec
View on GitHub
☆45Jun 11, 2024Updated 2 years ago
B06901052 / DeepSpeed
View on GitHub
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
☆13Oct 11, 2022Updated 3 years ago
JinhuaLiang / LaD-ProtoNet
View on GitHub
☆16Sep 14, 2023Updated 2 years ago
frankenliu / LOAE
View on GitHub
☆10Sep 25, 2024Updated last year