openai / openai-realtime-embeddedLinks

Instructions on how to use the Realtime API on Microcontrollers and Embedded Platforms

☆1,568

Alternatives and similar repositories for openai-realtime-embedded

Users that are interested in openai-realtime-embedded are comparing it to the libraries listed below

Sorting:

akdeb / ElatoAI
Realtime AI speech with OpenAI Realtime API and Gemini Live API on Arduino ESP32 with Secure Websockets and Deno Edge Functions with >15 …
☆1,106Updated this week
espressif / esp-brookesia
ESP-Brookesia is a human-machine interaction development framework designed for AIoT devices.
☆359Updated 2 weeks ago
FoloToy / folotoy-server-self-hosting
Config files for self-hosting the FoloToy Community Server. Documents: https://docs.folotoy.com
☆555Updated 8 months ago
wwbin2017 / bailing
百聆是一个类似GPT-4o的语音对话机器人，通过ASR+LLM+TTS实现，集成DeepSeek R1等优秀大模型，时延低至800ms，Mac等低配置也可运行，支持打断
☆1,370Updated last week
ideamark / desk-emoji
Desk-Emoji is a truly open-source AI desktop robot featuring an emoji screen, a two-axis console, and LLM capabilities for voice chat.
☆475Updated last week
wangzongming / esp-ai
The simplest and lowest-cost AI integration solution. If you like this project, please give it a Star~ | 最简单、最低成本的AI接入方案。喜欢本项目的话点个 Star 吧…
☆714Updated last month
TEN-framework / ten-vad
Voice Activity Detector(VAD) from TEN: low-latency, high-performance and lightweight
☆1,085Updated 2 weeks ago
dsa / fast-voice-assistant
⚡ Insanely fast AI voice assistant with <500ms response times
☆412Updated 8 months ago
TEN-framework / ten_framework.bak
The world’s first real-time, distributed, cloud-edge collaborative multimodal AI Agent Framework that simultaneously supports C/C++/Go/Py…
☆5Updated last month
mcp2everything / mcp2mqtt
本项目通过将 MCP 协议转换为 MQTT 协议，我们能够利用强大的大型语言模型（LLMs），就能轻松操控您的智能家居、机器人或其他硬件设备。
☆280Updated 7 months ago
lhl / voicechat2
Local SRT/LLM/TTS Voicechat
☆706Updated 9 months ago
pipecat-ai / rtvi-web-demo
Example UI implementing the RTVI web client
☆477Updated 8 months ago
realtime-ai / realtime-ai
A real-time Agent framework for audio and video.
☆148Updated last month
CerebriumAI / examples
Examples for Cerebrium Serverless GPUs
☆508Updated last week
78 / xiaozhi
Build your own AI friend
☆652Updated 2 months ago
openai / openai-fm
Code for openai.fm, a demo for the OpenAI Speech API
☆465Updated 3 months ago
FabrikappAgency / esp32-realtime-voice-assistant
☆72Updated 8 months ago
78 / mcp-calculator
Xiaozhi MCP sample program
☆169Updated last month
Standard-Intelligence / hertz-dev
first base model for full-duplex conversational audio
☆1,747Updated 7 months ago
facebookresearch / spiritlm
Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".
☆916Updated 9 months ago
chatmcp / mcp-server-chatsum
Query and Summarize your chat messages.
☆1,009Updated 8 months ago
ictnlp / StreamSpeech
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
☆1,127Updated last month
volcengine / rtc-aigc-demo
RTC AIGC Demo
☆181Updated 2 weeks ago
espressif / esp-box
The ESP-BOX is a new generation AIoT development platform released by Espressif Systems.
☆1,063Updated 2 weeks ago
thewh1teagle / kokoro-onnx
TTS with kokoro and onnx runtime
☆2,129Updated last month
FoloToy / folotoy-doc
All Documents for FoloToys
☆176Updated 5 months ago
moonshine-ai / moonshine
Fast and accurate automatic speech recognition (ASR) for edge devices
☆2,805Updated 2 months ago
NullMagic2 / SoftWhisper
SoftWhisper simplifies audio and video transcription using the powerful Whisper model. Easily select custom models, languages, and tasks,…
☆397Updated this week
Explorerlowi / ESP32_AI_LLM
本项目使用esp32、esp32s3接入Chatgpt、Claude、讯飞星火、豆包等15款大模型，实现语音对话聊天，支持语音唤醒、连续对话、音乐播放等功能，同时外接了一块显示屏实时显示对话的内容。
☆439Updated 8 months ago
kyutai-labs / hibiki
Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits f…
☆1,255Updated 3 months ago