wwbin2017/bailing

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/wwbin2017/bailing)

wwbin2017 / bailing

百聆是一个类似GPT-4o的语音对话机器人，通过ASR+LLM+TTS实现，集成DeepSeek R1等优秀大模型，接入openClaw，真正的个人语音助手，时延低至800ms，Mac等低配置也可运行，支持打断

☆1,740

Alternatives and similar repositories for bailing

Users that are interested in bailing are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

xinnan-tech / xiaozhi-esp32-server
View on GitHub
本项目为xiaozhi-esp32提供后端服务，帮助您快速搭建ESP32设备控制服务器。Backend service for xiaozhi-esp32, helps you quickly build an ESP32 device control server.
☆10,105Updated this week
TOM88812 / xiaozhi-web-client
View on GitHub
如果想体验小智项目，或者开发server端测试的同志，可以使用这个web端damo 体验下。语音端已经完成，文字端完成，可以语音加文字输出。等迭代慢慢完善。欢迎PR
☆183Jun 7, 2025Updated last year
huangjunsen0406 / py-xiaozhi
View on GitHub
Open-source AI assistant ecosystem with MCP integrations, multimodal workflows, IoT support, and cross-platform voice interaction.
☆3,419Updated this week
ABexit / ASR-LLM-TTS
View on GitHub
This is a speech interaction system built on an open-source model, integrating ASR, LLM, and TTS in sequence. The ASR model is SenceVoice…
☆1,260Jun 3, 2026Updated last month
TOM88812 / xiaozhi-android-client
View on GitHub
一个基于小智、xiaozhi-server的Android、IOS语音对话应用,支持实时语音交互和文字对话。现在是flutter版本，打通IOS、Android端。请同志们动动小手，点点小星星，予以鼓励。
☆1,545May 29, 2026Updated last month
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
78 / xiaozhi-esp32
View on GitHub
An MCP-based chatbot | 一个基于MCP的聊天机器人
☆28,253Updated this week
78 / xiaozhi
View on GitHub
Build your own AI friend
☆776Jun 7, 2025Updated last year
modelscope / FunASR
View on GitHub
Open-source speech recognition toolkit for training, inference, streaming ASR, VAD, punctuation, speaker diarization pipelines, and OpenA…
☆19,387Updated this week
FunAudioLLM / SenseVoice
View on GitHub
Open-source SenseVoiceSmall model for Mandarin, Cantonese, English, Japanese, and Korean ASR, language ID, emotion recognition, and audio…
☆8,911Updated this week
Henry-23 / VideoChat
View on GitHub
实时交互数字人，可自定义形象与音色，支持音色克隆，对话延迟低至3s。Real-time voice interactive digital human, customizable appearance and voice, supporting voice cloning,…
☆1,295Dec 18, 2025Updated 7 months ago
joey-zhou / xiaozhi-esp32-server-java
View on GitHub
小智ESP32的Java企业级管理平台，提供设备监控、音色定制、角色切换和对话记录管理的前后端及服务端一体化解决方案
☆1,314May 21, 2026Updated 2 months ago
HonestQiao / xiaozhi-py
View on GitHub
小智同学测试工具(websocket)
☆45Feb 20, 2025Updated last year
FunAudioLLM / CosyVoice
View on GitHub
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
☆22,323May 25, 2026Updated last month
FireRedTeam / FireRedASR
View on GitHub
Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR be…
☆1,937Feb 25, 2026Updated 4 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
zhh827 / py-xiaozhi
View on GitHub
☆243Nov 25, 2025Updated 7 months ago
modelscope / 3D-Speaker
View on GitHub
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
☆3,060Dec 8, 2025Updated 7 months ago
lipku / LiveTalking
View on GitHub
Real time interactive streaming digital human
☆8,467Updated this week
zai-org / GLM-4-Voice
View on GitHub
GLM-4-Voice | 端到端中英语音对话模型
☆3,206Dec 5, 2024Updated last year
TEN-framework / ten-framework
View on GitHub
Open-source framework for conversational voice AI agents
☆10,935Updated this week
Ikaros-521 / RealtimeSTT_LLM_TTS
View on GitHub
实时STT，连接OpenAI接口/智谱AI（流式LLM）和GPT-SOVITS/Edge-TTS，通过网页的方式，进行跨网络的服务调用，实现实时对话的效果
☆433Dec 31, 2024Updated last year
big-mouth-cn / talkx
View on GitHub
TalkX，一个开源的AI大模型聊天平台，支持编程插件、小智设备连接使用。
☆98Oct 16, 2025Updated 9 months ago
0x5446 / api4sensevoice
View on GitHub
API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition,…
☆538Oct 23, 2024Updated last year
stepfun-ai / Step-Audio
View on GitHub
☆34Mar 16, 2026Updated 4 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
k2-fsa / sherpa-onnx
View on GitHub
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime…
☆13,696Updated this week
jianchang512 / cosyvoice-api
View on GitHub
一个用于CosyVoice的api接口项目
☆333Aug 31, 2025Updated 10 months ago
snakers4 / silero-vad
View on GitHub
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
☆9,636Updated this week
xinhecuican / QSmartAssistant
View on GitHub
一个模块化，全过程可离线，低占用率的对话机器人/智能音箱
☆159Mar 25, 2026Updated 3 months ago
2noise / ChatTTS
View on GitHub
A generative speech model for daily dialogue.
☆39,652Apr 10, 2026Updated 3 months ago
xinchen-ai / Westlake-Omni
View on GitHub
☆203Sep 24, 2024Updated last year
rany2 / edge-tts
View on GitHub
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
☆11,553Mar 22, 2026Updated 3 months ago
pengzhendong / streaming-sensevoice
View on GitHub
Pseudo Streaming SenseVoice with Hotwords
☆466Jun 15, 2026Updated last month
espressif / esp-sr
View on GitHub
Speech recognition
☆1,441Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Kedreamix / Linly-Talker
View on GitHub
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LL…
☆3,394Feb 10, 2026Updated 5 months ago
morettt / SenseAI
View on GitHub
一个结合了ASR+LLM+TTS+监控的多功能AI机器人。支持所有以open ai为API调用格式的模型。支持LLM模型流式输出，以及对话打断、视频对话
☆29Apr 15, 2025Updated last year
kleinlee / DH_live
View on GitHub
每个人都能用的数字人
☆2,074May 21, 2026Updated 2 months ago
fishaudio / fish-speech
View on GitHub
SOTA Open Source TTS
☆31,346Jun 9, 2026Updated last month
HumanAIGC-Engineering / OpenAvatarChat
View on GitHub
☆3,636Jun 9, 2026Updated last month
labring / FastGPT
View on GitHub
FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data process…
☆29,059Updated this week
ruzhila / voiceapi
View on GitHub
Streaming ASR and TTS based on FastAPI+ sherpa-onnx
☆221Nov 2, 2025Updated 8 months ago