ighodgao / mamba-speech-synthesisView external linksLinks
Jupyter Notebook running Mamba speech synthesis example on Determined AI. Based on https://2084.substack.com/p/2084-marcrandbot-speech-synthesis
☆23Feb 8, 2024Updated 2 years ago
Alternatives and similar repositories for mamba-speech-synthesis
Users that are interested in mamba-speech-synthesis are comparing it to the libraries listed below
Sorting:
- FluentTTS: Text-dependent Fine-grained Style Control for Multi-style TTS☆20Nov 15, 2022Updated 3 years ago
- Obsidian plugin to switch Input Method when "InsertLeave" and "InsertEnter". Supports macOS, Windows, and Linux.☆19Dec 19, 2023Updated 2 years ago
- ☆68Dec 30, 2025Updated last month
- ☆70Sep 3, 2024Updated last year
- ☆68Jul 29, 2023Updated 2 years ago
- Repository for speech paper reading☆33Aug 19, 2021Updated 4 years ago
- Binaural impulse responses captured in real rooms.☆37Mar 9, 2016Updated 9 years ago
- Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"☆36Jan 15, 2020Updated 6 years ago
- VITS with phoneme-level prosody modeling based on MaskGIT☆85Aug 31, 2024Updated last year
- ACM MM 2024 FlashSpeech: Efficient Zero-Shot Speech Synthesis☆154Sep 20, 2024Updated last year
- [ACMMM'2024] Generative Expressive Conversational Speech Synthesis☆44Oct 28, 2024Updated last year
- Audio-FLAN☆160Sep 23, 2025Updated 4 months ago
- Demo for DART, Audio Imagination workshop submission in NeurIPS 2024☆12Apr 15, 2025Updated 10 months ago
- A beginner-friendly inference to finetune & run inference on open TTS models 🗣️☆26Feb 4, 2026Updated 2 weeks ago
- ☆11Aug 20, 2025Updated 5 months ago
- Official implementation of "AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and sta…☆49Nov 11, 2025Updated 3 months ago
- pytorch CTC implementation for ASR. Use eesen's fst decoder framework☆10Feb 27, 2020Updated 5 years ago
- Visual Leak Detector for Visual C++ 2008-2022☆12Dec 27, 2022Updated 3 years ago
- FaceFusion Windows Installer (unofficial)☆15Aug 5, 2024Updated last year
- Repository for my studies of Causal Inference☆10Dec 1, 2019Updated 6 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- Official Repository of Six Dragons Fly Again (ISMIR 2024)☆13Nov 13, 2025Updated 3 months ago
- 🎵 muse: Music Separation☆11Feb 14, 2024Updated 2 years ago
- Sound Separation, Omni modal☆28Sep 15, 2025Updated 5 months ago
- extending laughbot project to encoder-based transformer model finetuned on same dataset for humor classification☆10Jan 4, 2023Updated 3 years ago
- ☆11Nov 11, 2022Updated 3 years ago
- A fourier-based audio-synthesiser wrote in MATLAB as a university project.☆12Jan 19, 2019Updated 7 years ago
- textgrid.hpp - a C++ TextGrid parser / writer☆10Aug 4, 2021Updated 4 years ago
- Jax implementation of a flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm☆12Jan 29, 2025Updated last year
- ☆11Aug 28, 2017Updated 8 years ago
- Target speaker automatic speech recognition (TS-ASR)☆12Oct 14, 2023Updated 2 years ago
- Model for selecting perceptually relevant early reflections for parametric spatial sound rendering☆13Oct 26, 2023Updated 2 years ago
- ☆23Nov 3, 2025Updated 3 months ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- Simple Telegram bot to annotate and varify automatic speech recognition datasets☆12Mar 30, 2021Updated 4 years ago
- A Python app that converts vocal recordings into MIDI files. Transform your singing into digital music!☆17Nov 1, 2025Updated 3 months ago
- ☆44Sep 19, 2024Updated last year
- ☆54Jul 1, 2024Updated last year
- An unofficial (PyTorch) implementation for the paper Deep Lip Reading: A comparison of models and an online application.☆10May 13, 2020Updated 5 years ago