k2-fsa / sherpa-onnxLinks
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V, RK NPU, Axera NPU, Ascend NPU, x86_64 servers, websocket server/client, support 12 programming languages
☆9,720Updated this week
Alternatives and similar repositories for sherpa-onnx
Users that are interested in sherpa-onnx are comparing it to the libraries listed below
Sorting:
- Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, …☆1,604Updated 2 months ago
- Multilingual Voice Understanding Model☆7,361Updated 2 weeks ago
- A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity…☆14,486Updated last week
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.☆7,133Updated last year
- https://hf.co/hexgrad/Kokoro-82M☆5,336Updated 5 months ago
- Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key☆9,805Updated last month
- Silero VAD: pre-trained enterprise-grade Voice Activity Detector☆7,912Updated 2 weeks ago
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆18,989Updated this week
- TTS with kokoro and onnx runtime☆2,335Updated 3 weeks ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆3,519Updated 2 months ago
- SOTA Open Source TTS☆24,602Updated last week
- zero-shot voice conversion & singing voice conversion, with real-time support☆3,525Updated 8 months ago