xi-j / Mamba-ASRLinks
ConMamba for Automatic Speech Recognition
☆77Updated 10 months ago
Alternatives and similar repositories for Mamba-ASR
Users that are interested in Mamba-ASR are comparing it to the libraries listed below
Sorting:
- Official repository of NeXt-TDNN for speaker verification☆73Updated 8 months ago
- Clustering-based methods for overlapping diarization☆80Updated last year
- ☆82Updated 8 months ago
- The project is associated with the recently-launched INTERSPEECH 2025 Workshop on Multilingual Conversational Speech Language Model (MLC-…☆38Updated last month
- Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"☆170Updated 9 months ago
- LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement☆74Updated 2 months ago
- Implementation of the paper "Self-supervised Learning with Random-projection Quantizer for Speech Recognition" in Pytorch.☆81Updated 2 years ago
- ☆43Updated 2 years ago
- wav2vec2 audio classification for prosodic boundary detection and other tasks☆42Updated last year
- ☆56Updated 2 months ago
- ☆71Updated last year
- Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint☆69Updated 2 years ago
- Code for the Interspeech 2024 paper "MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting"☆32Updated last month
- [AAAI 2024] Code for CTX-vec2wav in UniCATS☆129Updated last year
- Reference-aware automatic speech evaluation toolkit☆155Updated 6 months ago
- CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence ar…☆78Updated last week
- Official repository for Mamba-based Segmentation Model for Speaker Diarization☆37Updated last month
- Official data preparation scripts for the URGENT 2024 Challenge☆80Updated last month
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆93Updated 7 months ago
- SLT 2024 Challenge: Post-ASR-Speaker-Tagging☆14Updated last year
- LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement☆42Updated 3 months ago
- A list of papers for child ASR☆42Updated 8 months ago
- Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".☆60Updated last week
- Source for the Interspeech 2024 Paper "Scaling up masked audio encoder learning for general audio classification"☆68Updated 2 months ago
- This is an evolving repo for the paper "Towards Controllable Speech Synthesis in the Era of Large Language Models: A Survey".☆146Updated this week
- Audio-FLAN☆157Updated 3 months ago
- Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation…☆63Updated 10 months ago
- A simple package for Guided source separation (GSS)☆124Updated last year
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆98Updated 9 months ago
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆28Updated 2 years ago