Baidu-AIP/speech_realtime_api

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Baidu-AIP/speech_realtime_api)

Baidu-AIP / speech_realtime_api

实时语音识别API WebSocket

☆161

Alternatives and similar repositories for speech_realtime_api

Users that are interested in speech_realtime_api are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Baidu-AIP / speech-demo
View on GitHub
语音api示例
☆710Jul 25, 2024Updated last year
zmeet-ai / asr_demo
View on GitHub
语音识别API，分实时语音和长语音离线上传识别，支持中英文等多达100个国家的语言实时转写和同声传译
☆85Dec 30, 2024Updated last year
aliyun / alibabacloud-nls-python-sdk
View on GitHub
“alibabacloud-nls-python-sdk提供使用阿里云智能语音服务的能力，包括语音识别、语音合成、文件转写等。”
☆81Aug 22, 2025Updated 10 months ago
wenet-e2e / wecut
View on GitHub
video cut powered by AI
☆24Nov 15, 2022Updated 3 years ago
Ryuk17 / ten-vad-edge
View on GitHub
Light-weight vad model(ten-vad) on edge device
☆43Jan 25, 2026Updated 5 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
lewangdev / CosyVoice
View on GitHub
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
☆13Jul 15, 2024Updated last year
liu2548253579 / text_voice_transfer_ifly
View on GitHub
基于科大讯飞平台的语音转文字和文字转语音程序
☆27Apr 23, 2023Updated 3 years ago
MeowHardware / esp32-chatmeow-server
View on GitHub
这是有vits的🐱
☆20Oct 3, 2023Updated 2 years ago
cartesia-ai / cartesia-livekit-voice-agent
View on GitHub
Voice agent using LiveKit (orchestration), Cartesia (STT + TTS), and OpenAI (LLM)
☆24Jun 3, 2026Updated 3 weeks ago
hedgeli / SimpleOffLineASR_Demo_sw
View on GitHub
离线命令词语音识别 Offline Simple Word ASR
☆16Jun 18, 2020Updated 6 years ago
ClimbSnail / RobotGeneralController
View on GitHub
通用机器人控制器上位机
☆11Feb 10, 2021Updated 5 years ago
csukuangfj / kaldilm
View on GitHub
Python wrapper for kaldi's arpa2fst
☆38Aug 27, 2025Updated 10 months ago
HonestQiao / xiaozhi-py
View on GitHub
小智同学测试工具(websocket)
☆45Feb 20, 2025Updated last year
Eric0308 / xiaozhi-client
View on GitHub
这是一个用于连接小智AI服务的Python客户端库。它提供了简单的接口来进行语音对话和文本交互。
☆27Mar 14, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
zhou19830318 / doubao_ai_agent_esp32
View on GitHub
基于micropython的esp32s3+豆包语音智能体实时语音对话智能助手
☆29Jun 29, 2025Updated last year
ruzhila / voiceapi
View on GitHub
Streaming ASR and TTS based on FastAPI+ sherpa-onnx
☆218Nov 2, 2025Updated 7 months ago
laoyin / freeswitch_docker_file
View on GitHub
☆36Oct 12, 2023Updated 2 years ago
miuser00 / BLEComm
View on GitHub
BLEComm based on new API of Windows10 OS. The tool could perform BLE device search, service and characteristics read/write and general BL…
☆16Nov 10, 2023Updated 2 years ago
ahh666 / vue-node-mysql
View on GitHub
【全栈初体验】Vue+Node+MySQL 实现前后端分离开发
☆15Feb 28, 2023Updated 3 years ago
fengfeng0328 / esp32_speech-vad-demo
View on GitHub
vad algorithm based on esp32 for mute detection
☆14Dec 9, 2018Updated 7 years ago
laoyin / freeswitch_admin_ui
View on GitHub
freeswitch_admin_ui
☆122Jan 7, 2024Updated 2 years ago
cornerfarmer / ctc_segmentation
View on GitHub
Segment a given audio into utterances using a trained end-to-end ASR model.
☆75Oct 9, 2020Updated 5 years ago
csukuangfj / kaldi_native_io
View on GitHub
python wrapper for kaldi's native I/O
☆27Jan 9, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
wilinz / flutter_edge_tts
View on GitHub
Flutter tts，使用 edge 大声朗读接口
☆17Jul 2, 2023Updated 2 years ago
Baidu-AIP / sdk-demo
View on GitHub
百度AI平台RESTful API SDK调用的示例
☆29Sep 3, 2019Updated 6 years ago
winlinvip / srs-k2
View on GitHub
Apply https://github.com/k2-fsa/sherpa-ncnn in live streaming and WebRTC
☆20Apr 16, 2023Updated 3 years ago
0x5446 / api4sensevoice
View on GitHub
API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition,…
☆540Oct 23, 2024Updated last year
jgpeiro / esp32_ai_assistant
View on GitHub
☆16Nov 20, 2022Updated 3 years ago
TochusC / OpenAI-MicrosoftTTS-QQ-Robot-by-YiriMirai
View on GitHub
通过调用OpenAI和MicrosoftTTS提供的API，而实现的支持语音聊天的ChatGPT QQ机器人。 A QQ chat robot implemented using YiriMirai with OpenAI and Microsoft TTS API
☆17Sep 17, 2024Updated last year
Gager-Git-life / llm_sts
View on GitHub
A real-time voice conversation system based on WebSocket and LLM, integrating Automatic Speech Recognition (ASR), Large Language Model co…
☆20Feb 11, 2025Updated last year
Gyanano / LLM_Assistant
View on GitHub
A voice assistant that calls a large language model made using the Lichuang ESP32-C3 development board.
☆18May 19, 2024Updated 2 years ago
319374267 / alipay-node
View on GitHub
前段时间自己的网站涉及到支付功能（自己网站后台是node.js开发的），在阅读了官方文档之后，打算在git上找一下开源的支付接口，没想到一个都不能使用，最后无赖自己根据官网资料，自己写了这个接口，原理比较简单，其实没有那么复杂，希望对初学者有帮助，如果有错误还望指出（本接…
☆12Aug 1, 2017Updated 8 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
pengzhendong / audio-pipeline
View on GitHub
☆23Oct 17, 2024Updated last year
afei-cn / JniDemo
View on GitHub
☆22Feb 13, 2020Updated 6 years ago
noahvelasco / AVA-Advanced-Virtual-Agent
View on GitHub
Flutter: A mobile assistant app utilizing OpenAI GPT and ElevenLabs Voice Text-To-Speech API's.
☆24Aug 25, 2023Updated 2 years ago
ashuangweiwang / xiaozhi_huanmengAI
View on GitHub
本项目基于虾哥小智开源代码进行自研二次开发，主要加入物联网控制部分，控制舵机，灯光；控制小狗完整代码，控制机器人等，持续更新
☆17Mar 31, 2025Updated last year
zhu260824 / face_mnn
View on GitHub
Android，MNN，人脸检测
☆24Mar 13, 2023Updated 3 years ago
cambridgeltl / ECNMT
View on GitHub
Emergent Communication Pretraining for Few-Shot Machine Translation
☆13Dec 3, 2020Updated 5 years ago
Gager-Git-life / ESP32-INMP441-VOICE-INPUT
View on GitHub
此为树莓派语音机器人音频输入终端节点，INMP441利用I2S协议采集音频数据，ESP32利用TCP把数据流传输至树莓派服务器。
☆19Apr 4, 2019Updated 7 years ago