muggle-stack / e2e_voiceLinks
基于ONNXRuntime以及LLama.cpp推理引擎实现的高性能C++语音推理框架,在性能极差的边缘设备上都能做到RTF<0.7实时对话。
☆38Updated last month
Alternatives and similar repositories for e2e_voice
Users that are interested in e2e_voice are comparing it to the libraries listed below
Sorting:
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆125Updated 2 years ago
- 端到端语音唤醒工具箱,从模型训练到模型推理。☆152Updated 5 months ago
- Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.☆69Updated 3 weeks ago
- Utilizes ONNX Runtime to transcribe audio into text.☆78Updated last week
- IndexTTS Fine-tuning notebooks☆132Updated 7 months ago
- SummerAsr 是一个基于C++的可独立编译且几乎没有额外依赖库的本地中文语音识别器。 Summer Asr is a Chinese automatic speech recognize project written with C++ that can be eas…☆101Updated last year
- Port of Funasr's Paraformer model in C/C++☆39Updated last year
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆108Updated 4 months ago
- A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting multiple languages.☆93Updated 4 months ago
- Utilizes ONNX Runtime for audio denoising.☆113Updated last month
- paraformer(chinense asr) online onnx runtime for python☆53Updated last year
- stt websockect server using sherpa-onnx☆46Updated 6 months ago
- Python Wrapper of Silero VAD☆64Updated 9 months ago
- ☆149Updated 2 years ago
- Utilizes ONNX Runtime for speech activity detection.☆41Updated last month
- We Speech Transcript based on LLM, in 300 lines of code.☆183Updated 7 months ago
- 语音识别模型pytorch转ONNX转MNN,C++实现部署☆83Updated 3 years ago
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆114Updated 2 months ago
- Kaldi-compatible online fbank extractor without external dependencies☆141Updated 3 months ago
- Efficient audio understanding with general audio captions☆397Updated 3 months ago
- FlashCosyVoice: A lightweight vLLM implementation built from scratch for CosyVoice.☆241Updated 2 months ago
- paraformer web server build with sanic☆28Updated 2 years ago
- MooER: Moore-threads Open Omni model for speech-to-speech intERaction. MooER-omni includes a series of end-to-end speech interaction mode…☆219Updated last year
- 基于FunASR实现语音识别,包含常规版和ONNX版(推荐)。☆48Updated last year
- Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction☆216Updated 11 months ago
- 修复funasr中seaco-paraformer导出onnx后没有时间戳的bug☆24Updated last year
- A Large-scale Cantonese Speech Corpus with Multi-dimensional Annotation☆264Updated 2 months ago
- Converting Chinese sentences into pinyin sequences, implemented in C++, very fast and easy to deploy.☆19Updated last month
- A cross platform implementation of Text-to-Speech based on ONNXRuntime.☆32Updated 2 years ago
- Causal streaming adaptation of OpenAI Whisper for real-time transcription on small audio chunks.☆62Updated 4 months ago