[SLT'24] The official implementation of SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model
☆136Nov 5, 2025Updated 4 months ago
Alternatives and similar repositories for ssamba
Users that are interested in ssamba are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆113Oct 1, 2024Updated last year
- ConMamba for Automatic Speech Recognition☆103Aug 12, 2024Updated last year
- This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)☆256Dec 12, 2025Updated 3 months ago
- ☆211Dec 5, 2024Updated last year
- Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"☆168Nov 24, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆101Jul 24, 2024Updated last year
- ☆33Dec 23, 2025Updated 3 months ago
- This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent spee…☆82Jun 7, 2024Updated last year
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆11May 14, 2025Updated 10 months ago
- Official implementation for our paper "Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations"☆41Aug 14, 2025Updated 7 months ago
- [EMNLP 2024] ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers☆125Mar 20, 2025Updated last year
- ☆12Mar 11, 2025Updated last year
- Official code release for "RTFS-Net: Recurrent time-frequency modelling for efficient audio-visual speech separation", accepted ICLR 2024☆50Oct 14, 2025Updated 5 months ago
- Official code of ElasticAST (Interspeech 2024 paper)☆34Jul 30, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆67Aug 16, 2023Updated 2 years ago
- Audio Codec Speech processing Universal PERformance Benchmark☆300Jan 8, 2026Updated 2 months ago
- Official Implementation and Dataset of paper - DFADD: The Diffusion and Flow-matching based Audio Deepfake Dataset☆15Apr 7, 2025Updated 11 months ago
- ☆14Oct 3, 2025Updated 5 months ago
- ☆25Sep 10, 2025Updated 6 months ago
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆114Jan 28, 2026Updated 2 months ago
- PyTorch implementation of Audio Flamingo: Series of Advanced Audio Understanding Language Models☆1,023Dec 15, 2025Updated 3 months ago
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆18Sep 13, 2024Updated last year
- Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement☆478May 19, 2025Updated 10 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A collection of audio signals accompanied by corresponding subjective scores of perceived quality. Everything under permissive licenses.☆47Feb 24, 2026Updated last month
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- A unified framework for Low-resource Audio Processing and Evaluation (SSL Pre-training and Downstream Fine-tuning)☆29Jul 9, 2024Updated last year
- The Open Source Code of UniAudio☆606Jul 22, 2024Updated last year
- Official Pytorch Implementation for "DDDM-VC: Decoupled Denoising Diffusion Models with Disentangled Representation and Prior Mixup for V…☆243Jul 31, 2024Updated last year
- MSP-Podcast Challenge Baseline Code for Interspeech 2025☆28Dec 4, 2024Updated last year
- Masked Modeling Duo: Towards a Universal Audio Pre-training Framework☆140Feb 23, 2026Updated last month
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated 11 months ago
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆70Aug 13, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Implementation of the paper: "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning" in pytorch☆14Updated this week
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆78Nov 1, 2024Updated last year
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Aug 1, 2025Updated 7 months ago
- The official implementation of TokenSynth (ICASSP 2025)☆81Oct 27, 2025Updated 5 months ago
- Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995☆79Dec 3, 2024Updated last year
- Official Repository for "Training-Free Multi-Step Audio Source Separation"☆54May 26, 2025Updated 10 months ago
- VAE modified from Descript Audio Codec, which replaces the RVQ with VAE☆90Apr 2, 2024Updated last year