kaistmm/Audio-Mamba-AuM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kaistmm/Audio-Mamba-AuM)

kaistmm / Audio-Mamba-AuM

Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"

☆173

Alternatives and similar repositories for Audio-Mamba-AuM

Users that are interested in Audio-Mamba-AuM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JiuFengSC / ElasticAST
View on GitHub
Official code of ElasticAST (Interspeech 2024 paper)
☆34Jul 30, 2024Updated last year
SiavashShams / ssamba
View on GitHub
[SLT'24] The official implementation of SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model
☆140Nov 5, 2025Updated 8 months ago
kaistmm / SSLalignment
View on GitHub
☆38May 28, 2025Updated last year
SarthakYadav / audio-mamba-official
View on GitHub
Official implementation for our paper "Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations"
☆44Aug 14, 2025Updated 11 months ago
kyegomez / AudioMamba
View on GitHub
Implementation of the paper: "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning" in pytorch
☆15Updated this week
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
signofthefour / fregrad
View on GitHub
Code repository for FreGrad
☆52May 19, 2024Updated 2 years ago
yongyizang / music-source-restoration
View on GitHub
Official Repository for "Music Source Restoration"
☆31Jun 1, 2025Updated last year
kaistmm / fregrad
View on GitHub
[ICASSP 2024] Official code for FreGrad
☆35May 13, 2024Updated 2 years ago
Sreyan88 / ReCLAP
View on GitHub
☆33Dec 23, 2025Updated 7 months ago
xi-j / Mamba-ASR
View on GitHub
ConMamba for Automatic Speech Recognition
☆106Aug 12, 2024Updated last year
kaist-ami / AVHBench
View on GitHub
[ICLR'25] Official repository for "AVHBench: A Cross-Modal Hallucination Evaluation for Audio-Visual Large Language Models"
☆25Mar 8, 2026Updated 4 months ago
diggerdu / AudioMamba
View on GitHub
☆12Jun 1, 2024Updated 2 years ago
JishengBai / AudioSetCaps
View on GitHub
A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline
☆208Dec 13, 2024Updated last year
art-jang / LiTFiC
View on GitHub
[CVPR2025] Official code for Lost in Translation Found in Context
☆24Jan 14, 2026Updated 6 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
shincling / discreteSeparation
View on GitHub
The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".
☆12Oct 25, 2021Updated 4 years ago
Stability-AI / stable-codec
View on GitHub
A family of state-of-the-art Transformer-based audio codecs for low-bitrate high-quality audio coding.
☆437Jul 17, 2026Updated last week
adefossez / audio_mod_idessai
View on GitHub
Repo for the IDESSAI 2024 course on modeling audio with discrete tokens.
☆13Sep 13, 2024Updated last year
glory20h / VoiceLDM
View on GitHub
VoiceLDM: Text-to-Speech with Environmental Context
☆194Aug 9, 2024Updated last year
Torabiy / HLS-CMDS
View on GitHub
Heart and Lung Sounds Dataset Recorded from a Clinical Manikin using Digital Stethoscope (HLS-CMDS)
☆19May 13, 2026Updated 2 months ago
RoyChao19477 / SEMamba
View on GitHub
This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)
☆273Dec 12, 2025Updated 7 months ago
denfed / wave-spec-fusion
View on GitHub
Code for the submitted 2021 DCASE Workshop paper: "Waveforms and Spectrograms: Enhancing Acoustic Scene Classification Using Multimodal F…
☆16Aug 9, 2021Updated 4 years ago
JusperLee / SPMamba
View on GitHub
☆227Dec 5, 2024Updated last year
ytaek-oh / vl_compo
View on GitHub
☆10Jul 5, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
xi-j / Mamba-TasNet
View on GitHub
☆116Oct 1, 2024Updated last year
kaistmm / VoiceDiT
View on GitHub
[ICASSP2025] Official code for VoiceDiT: Dual-Condition Diffusion Transformer for Environment-Aware Speech Synthesis
☆52Apr 9, 2025Updated last year
ZacharyNovack / Lead-AE
View on GitHub
Official Repository of Unsupervised Lead Sheet Generation via Semantic Compression
☆22Oct 23, 2023Updated 2 years ago
evelyn0414 / OPERA
View on GitHub
This is the official code release for OPERA: OPEn Respiratory Acoustic foundation models
☆83Mar 11, 2025Updated last year
kaist-ami / Sound2Scene
View on GitHub
☆43Apr 14, 2025Updated last year
yzGuu830 / efficient-speech-codec
View on GitHub
[EMNLP 2024] ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers
☆126Mar 20, 2025Updated last year
swagshaw / Rainbow-Keywords
View on GitHub
Rainbow Keywords - Official PyTorch Implementation
☆14Jun 27, 2024Updated 2 years ago
sony / sampleid
View on GitHub
Code for the paper “Automatic Music Sample Identification with Multi-Track Contrastive Learning”.
☆25May 22, 2026Updated 2 months ago
aeromamba-super-resolution / aeromamba
View on GitHub
Official implementation of "AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and sta…
☆50Nov 11, 2025Updated 8 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
lysanderism / TimeAudio
View on GitHub
The official repository TimeAudio, a comprehensive framework that incorporates fine-grained acoustic cues into LALMs with enhanced module…
☆30Nov 18, 2025Updated 8 months ago
JinhuaLiang / LaD-ProtoNet
View on GitHub
☆16Sep 14, 2023Updated 2 years ago
seungheondoh / msu-benchmark
View on GitHub
music semantic understanding evaluation benchmark
☆24Aug 12, 2023Updated 2 years ago
RanaCM / DSU-AVO
View on GitHub
Source code and speech samples for the DSU-AVO paper accepted to INTERSPEECH 2023
☆12May 13, 2024Updated 2 years ago
NilsDem / control-transfer-diffusion
View on GitHub
Repository for the paper "Combining audio control and style transfer using latent diffusion", accepted at ISMIR 2024
☆67Feb 19, 2025Updated last year
slSeanWU / beats-conformer-bart-audio-captioner
View on GitHub
PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…
☆41Jan 6, 2024Updated 2 years ago
swagshaw / WildDESED
View on GitHub
WildDESED: A LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection
☆18Nov 19, 2024Updated last year