ighodgao / mamba-speech-synthesisLinks
Jupyter Notebook running Mamba speech synthesis example on Determined AI. Based on https://2084.substack.com/p/2084-marcrandbot-speech-synthesis
☆20Updated last year
Alternatives and similar repositories for mamba-speech-synthesis
Users that are interested in mamba-speech-synthesis are comparing it to the libraries listed below
Sorting:
- ☆88Updated 10 months ago
- [ICASSP 2025] "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆75Updated 6 months ago
- [SLT'24] The official implementation of SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model☆125Updated 9 months ago
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆101Updated 11 months ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆50Updated 4 months ago
- ConMamba for Automatic Speech Recognition☆80Updated 11 months ago
- ☆63Updated last year
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆37Updated last year
- ☆61Updated 2 years ago
- ☆35Updated 7 months ago
- Viterbi decoding in PyTorch☆36Updated 2 months ago
- ☆44Updated last year
- A lightweight audio codec based on a single quantizer☆66Updated 4 months ago
- Contains the code associated with the ICLR submission for our text-to-speech diffusion model☆54Updated last year
- ☆41Updated 10 months ago
- SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)☆79Updated last week
- ☆48Updated 4 months ago
- A low-bitrate single-codebook 16 kHz speech codec based on focal modulation☆93Updated 5 months ago
- ☆104Updated 3 months ago
- [Findings of NAACL 2024] Source code of paper CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers a…☆67Updated last year
- Simplistic Implementation of Zipformer:A faster and better encoder for automatic speech recognition in PyTorch☆10Updated last year
- Official repo of ICASSP 2024 paper - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.☆55Updated last month
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆36Updated 2 months ago
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆53Updated 11 months ago
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆96Updated 7 months ago
- Official Implementation of EnCLAP (ICASSP 2024)☆92Updated last year
- ☆32Updated 8 months ago
- Official implementation for FlowSep☆58Updated 7 months ago
- Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986☆47Updated 9 months ago
- Streaming Audiotransformers for online Audio tagging☆46Updated last year