78 / xiaozhi-esp32
Build your own AI friend
☆10,068Updated this week
Alternatives and similar repositories for xiaozhi-esp32:
Users that are interested in xiaozhi-esp32 are comparing it to the libraries listed below
- 本项目为xiaozhi-esp32提供后端服务,帮助您快速搭建ESP32设备控制服务器。Backend service for xiaozhi-esp32, helps you quickly build an ESP32 device control server.☆2,859Updated this week
- python版本的小智ai,主要帮助那些没有硬件却想体验小智功能的人☆599Updated this week
- Build your own AI friend☆499Updated 2 months ago
- 百聆 是一个类似GPT-4o的语音对话机器人,通过ASR+LLM+TTS实现,集成DeepSeek R1等优秀大模型,时延低至800ms,Mac等低配置也可运行,支持打断☆997Updated 2 weeks ago
- 🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!☆17,134Updated last month
- 一个基于xiaozhi-server的Android语音对话应用,支持实时语音交互和文字对话。现在全力输出flutter版本,打通IOS、Android端。请同志们动动小手,点点小星星,予以鼓励。☆294Updated this week
- 一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with su…☆6,893Updated 3 months ago
- A generative speech model for daily dialogue.☆35,409Updated 2 weeks ago
- 本项目使用esp32、esp32s3接入Chatgpt、Claude、讯飞星火、豆包等15款大模型,实现语音对话聊天,支持语音唤醒、连续对话、音乐播放等功能,同时外接了一块显示屏实时显示对话的内容。☆378Updated 3 months ago
- The simplest and lowest-cost AI integration solution. If you like this project, please give it a Star~ | 最简单、最低成本的AI接入方案。喜欢本项目的话点个 Star 吧…☆942Updated 2 weeks ago
- 🍒 Cherry Studio is a desktop client that supports for multiple LLM providers. Support deepseek-r1☆21,019Updated this week
- Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切…☆12,079Updated last week
- ☆202Updated last month
- Use LLMs to dig out what you care about from massive amounts of information and a variety of sources daily.☆7,210Updated this week
- esp32 based device, mainly used for voice chat with large language models☆752Updated last year
- Elegant reading of real-time and hottest news☆5,465Updated last week
- PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker/…☆19,414Updated this week
- ESP32C3 AI对话小音箱 你好小智☆172Updated 3 months ago
- Multilingual Voice Understanding Model☆5,121Updated last week
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆12,413Updated this week
- ☆4,087Updated 2 weeks ago
- Desk-Emoji is a truly open-source AI desktop robot featuring an emoji screen, a two-axis console, and LLM capabilities for voice chat.☆396Updated 3 months ago
- Speech-to-text, text-to-speech, speaker diarization, speech enhancement, and VAD using next-gen Kaldi with onnxruntime without Internet c…☆5,357Updated last week
- 利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.☆25,776Updated last week
- SOTA Open Source TTS☆20,268Updated last week
- A simple screen parsing tool towards pure vision based GUI agent☆21,127Updated this week
- A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity…☆9,169Updated this week
- 全球最小的桌面级双轮腿机器人!☆1,314Updated 3 months ago
- 💬 Ready-to-use & flexible RAG Chatbot, supporting mainstream large language models (LLMs) such as DeepSeek-R1, Llama 3.3, Qwen2, OpenAI …☆15,031Updated this week
- Like Manus, Computer Use Agent(CUA) and Omniparser, we are computer-using agents.AI-driven local automation assistant that uses natural l…☆2,807Updated this week