Tonyyouyou / Mamba-in-SpeechView external linksLinks
☆54Jul 1, 2024Updated last year
Alternatives and similar repositories for Mamba-in-Speech
Users that are interested in Mamba-in-Speech are comparing it to the libraries listed below
Sorting:
- ☆10Dec 22, 2023Updated 2 years ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆10Sep 30, 2024Updated last year
- ☆16Nov 9, 2023Updated 2 years ago
- Target speaker automatic speech recognition (TS-ASR)☆12Oct 14, 2023Updated 2 years ago
- ConMamba for Automatic Speech Recognition☆102Aug 12, 2024Updated last year
- ☆109Oct 1, 2024Updated last year
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated 10 months ago
- Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)☆12Mar 12, 2024Updated last year
- [INTERSPEECH 2024] Official pytorch code for the paper "Disentangled Representation Learning for Environment-agnostic Speaker Recognition…☆18Jul 23, 2024Updated last year
- This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).☆36Dec 17, 2024Updated last year
- ☆18Mar 13, 2024Updated last year
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Aug 1, 2025Updated 6 months ago
- The official implementation of DMEL the method presented in the paper "DMEL: The differentiable log-Mel spectrogram as a trainable layer …☆22Dec 21, 2024Updated last year
- ☆16Dec 18, 2023Updated 2 years ago
- speaker-disentangled speech linguistic content quantizer☆24Mar 19, 2025Updated 10 months ago
- Code for paper "Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition"☆20May 24, 2023Updated 2 years ago
- Demo for DART, Audio Imagination workshop submission in NeurIPS 2024☆12Apr 15, 2025Updated 10 months ago
- ☆54Jul 16, 2025Updated 7 months ago
- Flow control nodes for comfyUI, allowing for more diverse workflows☆12Apr 3, 2025Updated 10 months ago
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆12Sep 29, 2025Updated 4 months ago
- 🎵 muse: Music Separation☆11Feb 14, 2024Updated 2 years ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 10 months ago
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- Domain Adaptation with Adversarial Training on Penultimate Activations (AAAI 2023)☆11Aug 1, 2023Updated 2 years ago
- Code for paper Audio Visual Speaker Localization from EgoCentric Views☆11Jul 3, 2024Updated last year
- ☆11Jun 14, 2024Updated last year
- A neural speech codec based on discrete WavLM representations☆24Aug 28, 2024Updated last year
- Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enh…☆54Mar 5, 2025Updated 11 months ago
- ☆50Aug 16, 2023Updated 2 years ago
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆12Feb 5, 2025Updated last year
- official implementation of paper ExPO: Explainable Phonetic Trait-Oriented Network for Speaker Verification☆14Mar 14, 2025Updated 11 months ago
- Adaptive Global-Local Representation Learning and Selection for Cross-Domain Facial Expression Recognition (TMM 2024)☆16Aug 13, 2024Updated last year
- ☆14Nov 26, 2024Updated last year
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆11Mar 14, 2025Updated 11 months ago
- ☆13Oct 11, 2024Updated last year
- ☆15Nov 11, 2024Updated last year
- Hpyformer base FunASR☆30Nov 5, 2024Updated last year
- ☆11May 7, 2022Updated 3 years ago