sipeed / Maix-SpeechView external linksLinks
Maix Speech AI lib, a fast and small speech lib running on embedded devices, including ASR, chat, TTS etc.
☆360Sep 28, 2022Updated 3 years ago
Alternatives and similar repositories for Maix-Speech
Users that are interested in Maix-Speech are comparing it to the libraries listed below
Sorting:
- New MaixCDK will replace this repo: https://github.com/sipeed/MaixCDK☆68Aug 4, 2023Updated 2 years ago
- Open Source Speech/Text Data on AI☆19Sep 13, 2022Updated 3 years ago
- A repository comprising of code for generation of noisy speech data from clean data using deep learning methods☆15Jul 12, 2021Updated 4 years ago
- ICASSP2022 TTS&VC Summary☆14Jun 9, 2022Updated 3 years ago
- Production First and Production Ready End-to-End Keyword Spotting Toolkit☆690Sep 17, 2025Updated 4 months ago
- Neural network-based forced alignment with bidirectional attention mechanism☆78Jan 17, 2025Updated last year
- 这是一个用C++实现ASR推理的项目,它依赖很少,安装也很简单,推理速度很快,在树莓派4B等ARM平台也可以流畅的运行。 支持的模型是由Google的Transformer模型中优化而来,数据集是开源wenetspeech(10000+小时)或阿里私有数据集(60000+小…☆545Mar 19, 2023Updated 2 years ago
- a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi☆344Dec 25, 2020Updated 5 years ago
- Python sdk for Sipeed Maix-II-Dock(v831). Other board please use https://github.com/sipeed/MaixPy☆175Jan 20, 2024Updated 2 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- Towards hot directions in industrial end to end speech recognition☆331Nov 30, 2021Updated 4 years ago
- magicspeech competition recipe☆18Jun 29, 2020Updated 5 years ago
- simple energy vad☆19Jun 3, 2017Updated 8 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Aug 31, 2020Updated 5 years ago
- C/C++ development kit for Sipeed Maix ecosystem boards☆108Feb 2, 2026Updated last week
- ☆45Oct 24, 2020Updated 5 years ago
- E2E system with LF-MMI; word N-gram for Mandarin☆166Apr 29, 2022Updated 3 years ago
- Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.☆71Mar 19, 2021Updated 4 years ago
- Production First and Production Ready End-to-End Speech Recognition Toolkit☆5,026Dec 19, 2025Updated last month
- Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech (INTERSPEECH 2022)☆121Jan 24, 2023Updated 3 years ago
- Based on https://github.com/fatchord/WaveRNN☆24May 3, 2020Updated 5 years ago
- Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.☆262Oct 11, 2019Updated 6 years ago
- ☆276Jan 15, 2021Updated 5 years ago
- ☆147Aug 2, 2020Updated 5 years ago
- DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)☆301Jun 15, 2020Updated 5 years ago
- ☆25Mar 12, 2022Updated 3 years ago
- Python wrapper for kaldi's arpa2fst☆37Aug 27, 2025Updated 5 months ago
- MicroPython for K210 RISC-V, let's play with edge AI easier☆1,713Jun 17, 2024Updated last year
- Interface for Controllable Expressive Talking Machine☆40Sep 20, 2025Updated 4 months ago
- video cut powered by AI☆24Nov 15, 2022Updated 3 years ago
- Labels for kiritan_singing data with extra resources for DNN-based singing voice synthesis (SVS) systems.☆29Dec 31, 2023Updated 2 years ago
- Multipurpose Multi Speaker Mixture Signal Generator☆46Feb 6, 2025Updated last year
- ☆15Oct 11, 2019Updated 6 years ago
- Code for "Distribution-based Emotion Recognition in Conversation"☆19Feb 6, 2023Updated 3 years ago
- ☆40Aug 15, 2021Updated 4 years ago
- ☆43May 16, 2022Updated 3 years ago
- ☆25Apr 24, 2019Updated 6 years ago
- Large, modern dataset for speech recognition☆719Feb 26, 2024Updated last year
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆91Feb 18, 2025Updated 11 months ago