muggle-stack/e2e_voice

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/muggle-stack/e2e_voice)

muggle-stack / e2e_voice

High-performance C++ voice interaction framework powered by ONNXRuntime and LLaMA.cpp. Features AEC, VAD, ASR, TTS, LLM, and MCP integration with real-time conversation (RTF < 0.7) even on low-end edge devices.

☆51

Alternatives and similar repositories for e2e_voice

Users that are interested in e2e_voice are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

muggle-stack / sensevoice_cpp
View on GitHub
☆25Mar 8, 2026Updated 4 months ago
RapidAI / RapidSpeech.cpp
View on GitHub
On-device speech AI runtime for ASR, TTS, VAD, and voice cloning. Python-simple, C++-native, GGUF-powered.
☆22Jul 15, 2026Updated 2 weeks ago
L6-NLP / Generative-Annotation-NEC
View on GitHub
Generative_Annotation_NEC: A novel NEC method that utilizes speech sound features to retrieve candidate entities and a generative method …
☆17Dec 2, 2025Updated 7 months ago
wangzhaode / tokenizer.cpp
View on GitHub
A lightweight, production-ready C++ library for LLM tokenization, fully compatible with HuggingFace tokenizer.json.
☆33Jan 4, 2026Updated 6 months ago
k2-fsa / kaldifst
View on GitHub
Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files
☆56Apr 9, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Wasser1462 / FunASR-nano-onnx
View on GitHub
A lightweight demo of FunASR-Nano using ONNX runtime.
☆83Feb 25, 2026Updated 5 months ago
thu-spmi / CTC-TTS
View on GitHub
Code for CTC-TTS: LLM-based dual-streaming text-to-speech with CTC alignment, Interspeech 2026.
☆20Jun 9, 2026Updated last month
Huiyicc / gpt_sovits_cpp
View on GitHub
GPT-Sovits的c++实现版本
☆22Jan 9, 2026Updated 6 months ago
DakeQQ / Text-to-Speech-TTS-ONNX
View on GitHub
Utilizes ONNX Runtime for TTS model.
☆65Updated this week
8b-is / IndexTTS-Rust
View on GitHub
☆20May 2, 2026Updated 2 months ago
pkufool / cppinyin
View on GitHub
Converting Chinese sentences into pinyin sequences, implemented in C++, very fast and easy to deploy.
☆23Jan 5, 2026Updated 6 months ago
bluryar / VoxCPM-ONNX
View on GitHub
☆49Mar 18, 2026Updated 4 months ago
carsonpo / safetensors.cpp
View on GitHub
Zero Dependency LibTorch Safetensors Loading and Storing in C++
☆23Jul 12, 2024Updated 2 years ago
ailia-ai / ailia-models-cpp
View on GitHub
C++ version of ailia models repository
☆26May 14, 2026Updated 2 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
William1617 / REAL_TIME_NKF_AEC
View on GitHub
☆24Jul 29, 2024Updated 2 years ago
huangcanan / Awesome-Large-Speech-Model
View on GitHub
A repository used to organize content related to Large Speech(Audio) Model, including paper, data, applications, tools and so on.
☆28Nov 8, 2025Updated 8 months ago
lovemefan / paraformer.cpp
View on GitHub
Port of Funasr's Paraformer model in C/C++
☆43Jun 19, 2024Updated 2 years ago
wenet-e2e / wesignal
View on GitHub
Production first, nn-based on-device signal processing toolkit.
☆63May 30, 2023Updated 3 years ago
DakeQQ / Audio-Denoiser-ONNX
View on GitHub
Utilizes ONNX Runtime for audio denoising.
☆134Updated this week
lovemefan / SenseVoice.cpp
View on GitHub
Port of Funasr's Sense-voice model in C/C++
☆569Dec 19, 2025Updated 7 months ago
yuekaizhang / Fun-ASR-vllm
View on GitHub
Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.
☆108Jul 7, 2026Updated 3 weeks ago
ZhouSiChuan08 / CosyVoiceCpp
View on GitHub
The cpp-based deployment of CosyVoice2
☆20Sep 7, 2025Updated 10 months ago
manyeyes / ManySpeech
View on GitHub
AI Speech Solutions for Tasks such as ASR, Vocal Extraction, Accompaniment Extraction, Audio Denoising, and Enhancement, Support models s…
☆85Jun 16, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
k2-fsa / kaldi-decoder
View on GitHub
Decoders from Kaldi using OpenFst
☆35Apr 10, 2026Updated 3 months ago
Xiaobin-Rong / deepvqe
View on GitHub
An unofficial implementation of DeepVQE proposed by Microsoft Corp.
☆148Mar 24, 2025Updated last year
iliasam / ROS_simple_nav
View on GitHub
Simple Navigation Node for the ROS
☆11Feb 26, 2018Updated 8 years ago
DakeQQ / Voice-Activity-Detection-VAD-ONNX
View on GitHub
Utilizes ONNX Runtime for speech activity detection.
☆46Updated this week
v1nh1shungry / structopt
View on GitHub
A lovely structopt library for C++! Parse command line arguments by defining a struct! ❤️
☆12Apr 24, 2023Updated 3 years ago
aispeech-lab / TinyWASE
View on GitHub
PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…
☆11Jun 28, 2021Updated 5 years ago
wythers / zeus
View on GitHub
This is part of the zeus library, just for sharing and funny.
☆35Apr 5, 2023Updated 3 years ago
NiniAndy / Paraformer-V2
View on GitHub
来自于文章Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition
☆29Nov 20, 2024Updated last year
neuralps3d / neuralps3d
View on GitHub
Neural Reflectance Field from Shading and Shadow under a Fixed Viewpoint
☆16Aug 8, 2022Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
jark006 / SummerTTS
View on GitHub
SummerTTS 是一个基于C++的独立编译的中文和英文语音合成项目，可以本地运行不需要网络，而且没有额外的依赖，一键编译完成即可用于中文和英文的语音合成。SummerTTS is a standalone Chinese and English speech synt…
☆25Aug 17, 2024Updated last year
kimsunwiub / BLOOM-Net
View on GitHub
Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"
☆14Feb 13, 2022Updated 4 years ago
coreeey / OCT2Former
View on GitHub
The official code of OCT2Former for Retinal OCT-Angiography vessel segmentation
☆17May 8, 2023Updated 3 years ago
wxqwinner / silero-vad-ncnn
View on GitHub
Silero VAD(ncnn): pre-trained enterprise-grade Voice Activity Detector.
☆26Aug 21, 2024Updated last year
manyeyes / AliFsmnVad
View on GitHub
C# library for decoding Fsmn Vad model , used in speech activity detection
☆23Aug 11, 2025Updated 11 months ago
for-geeks / geek-car
View on GitHub
Geek Car, An Autonomous Application Based On Cyber RT http://for-geeks.com
☆13Feb 5, 2026Updated 5 months ago
Cambricon / easydk
View on GitHub
easy development kit
☆12Apr 18, 2025Updated last year