A real-time and light-weight software for generation of non-linguistic behaviors (turn-taking, backchannel, and head-nodding) in conversational AIs
☆81Feb 20, 2026Updated last week
Alternatives and similar repositories for MaAI
Users that are interested in MaAI are comparing it to the libraries listed below
Sorting:
- A real-time implementation of Voice Activity Projection (VAP) is aimed at controlling behaviors of spoken dialogue systems, such as turn-…☆93Jul 24, 2025Updated 7 months ago
- Fine-tuning Moshi/J-Moshi on your own spoken dialogue data☆88Jan 5, 2026Updated last month
- Speech Resynthesis and Language Modeling☆27Jun 11, 2025Updated 8 months ago
- ☆15Nov 10, 2025Updated 3 months ago
- JATTS: A modern, research-oriented Japanese Text-to-speech Open-sourced Toolkit☆44May 26, 2025Updated 9 months ago
- ☆13Oct 27, 2021Updated 4 years ago
- A neural speech codec based on discrete WavLM representations☆24Aug 28, 2024Updated last year
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆18Nov 30, 2022Updated 3 years ago
- ☆19Updated this week
- Deploy Kaldi models using grpc for bidirectional streaming.☆17Sep 30, 2024Updated last year
- An official implementation of Style-Talker for Spoken Dialogue Generation☆23Jan 12, 2025Updated last year
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- 5Hz Deep-Compression Speech VAE for AR-Diffusion and CALMs☆57Nov 19, 2025Updated 3 months ago
- AI based singing voice synthesis☆37Jun 10, 2024Updated last year
- Official code of SenSE.☆74Oct 30, 2025Updated 4 months ago
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆45Feb 9, 2025Updated last year
- Temporary anonymous version☆22Mar 20, 2024Updated last year
- Coco-Nut (Corpus of connecting NIHONGO utterance and text) corpus☆21Jun 12, 2024Updated last year
- ☆54Jul 16, 2025Updated 7 months ago
- ☆13Sep 25, 2024Updated last year
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 10 months ago
- ☆13Apr 14, 2024Updated last year
- ☆13Oct 25, 2024Updated last year
- Provides simple decoding and encoding of audio codecs for Unity.☆16Mar 21, 2023Updated 2 years ago
- ☆31Oct 29, 2024Updated last year
- A TTS Trained on Universal Audio.☆41Jun 6, 2025Updated 8 months ago
- Non Parallel Voice Conversion based on VITS☆24Mar 31, 2023Updated 2 years ago
- ☆25Mar 12, 2022Updated 3 years ago
- ☆25Jun 14, 2022Updated 3 years ago
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Jun 16, 2022Updated 3 years ago
- PyTorch implementation of "Source Separation by Flow Matching (FLOSS)" by Google DeepMind☆92Nov 24, 2025Updated 3 months ago
- Java Bindings for the C++ library DeepSpeech☆10Jun 4, 2020Updated 5 years ago
- Using OpenVINO to speed up MeloTTS inference☆15Nov 1, 2024Updated last year
- ☆22Jul 30, 2025Updated 7 months ago
- This is a subset of the DALI set consisting of 240 polyphonic recordings that is used to benchmark lyrics transcription evaluation.☆12Nov 30, 2021Updated 4 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 5 years ago
- ☆13Oct 11, 2024Updated last year
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Speech recognition module for Python, supporting several engines and APIs, online and offline.☆13Mar 9, 2022Updated 3 years ago