实时语音识别API WebSocket
☆158Jul 16, 2024Updated last year
Alternatives and similar repositories for speech_realtime_api
Users that are interested in speech_realtime_api are comparing it to the libraries listed below
Sorting:
- 内容审核及速率限制服务☆26May 18, 2025Updated 9 months ago
- 语音识别API,分实时语音和长语音离线上传识别,支持中英文等多达100个国家的语言实时转写和同声传译☆81Dec 30, 2024Updated last year
- Light-weight vad model(ten-vad) on edge device☆40Jan 25, 2026Updated last month
- ☆12Nov 12, 2020Updated 5 years ago
- 通用机器人控制器上位机☆11Feb 10, 2021Updated 5 years ago
- 小智同学测试工具(websocket)☆47Feb 20, 2025Updated last year
- “alibabacloud-nls-python-sdk提供使用阿里云智能语音服务的能力,包括语音识别、语音合成、文件转写等。”☆79Aug 22, 2025Updated 6 months ago
- An AI chat bot based on volcengine's webRTC protocol.☆35Apr 27, 2025Updated 10 months ago
- A talking clock in Chinese for esp32 s3 Box with mp3 player and temperature reading☆12May 7, 2023Updated 2 years ago
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆13Jul 15, 2024Updated last year
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)☆20Oct 28, 2025Updated 4 months ago
- vad algorithm based on esp32 for mute detection☆13Dec 9, 2018Updated 7 years ago
- 本项目基于虾哥小智开源代码进行自研二次开发,主要加入物联网控制部分,控制舵机,灯光;控制小狗完整代码,控制机器人等,持续更新☆17Mar 31, 2025Updated 11 months ago
- ☆16Nov 20, 2022Updated 3 years ago
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆14Apr 9, 2025Updated 10 months ago
- 基于micropython的esp32s3+豆包语音智能体实时语音对话智能助手☆24Jun 29, 2025Updated 8 months ago
- Memory efficient transducer loss computation☆69Jun 10, 2022Updated 3 years ago
- Python wrapper for kaldi's arpa2fst☆38Aug 27, 2025Updated 6 months ago
- YunDo is an open-source intelligent dialogue system based on large models.☆21Aug 9, 2024Updated last year
- Simple C++ wrapper of the SRT protocol for building Server/Client transport solutions☆20May 7, 2024Updated last year
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆22May 9, 2025Updated 9 months ago
- Apply https://github.com/k2-fsa/sherpa-ncnn in live streaming and WebRTC☆20Apr 16, 2023Updated 2 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆74Oct 9, 2020Updated 5 years ago
- 这是有vits的🐱☆20Oct 3, 2023Updated 2 years ago
- HMM, CTC, RNN-Transducer, forward-backward algorithm☆20Sep 5, 2023Updated 2 years ago
- ☆23Oct 17, 2024Updated last year
- 基于esp32-c3 和 火山引擎流式接口的 语音聊天机器人☆27Oct 15, 2024Updated last year
- Flutter: A mobile assistant app utilizing OpenAI GPT and ElevenLabs Voice Text-To-Speech API's.☆25Aug 25, 2023Updated 2 years ago
- API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition,…☆538Oct 23, 2024Updated last year
- Decoders from Kaldi using OpenFst☆34Jan 29, 2026Updated last month
- python wrapper for kaldi's native I/O☆27Jan 9, 2025Updated last year
- 对接UniMRCP,讯飞语音识别,FreeSWITCH。用WebRTC AVD模块优化UniMRCP的VAD功能☆55Oct 24, 2019Updated 6 years ago
- 🛍 A full E-commerce app with nice UI consists of on-boarding, login, sign-up, home, product details, cart and user profile.☆10Sep 8, 2024Updated last year
- FreeSWITCH's ESL SWIG wrapper for Python packaged with setuptools☆25Aug 26, 2022Updated 3 years ago
- Implementation of Imputer: Sequence Modelling via Imputation and Dynamic Programming in PyTorch☆58May 3, 2020Updated 5 years ago
- 小智的视觉对话☆32Apr 25, 2025Updated 10 months ago
- 基于micropython的xiaozhi☆38Apr 19, 2025Updated 10 months ago
- ☆68Jan 15, 2026Updated last month
- 小智基于智谱Phone Agent复刻豆包手机☆65Dec 18, 2025Updated 2 months ago