ai-bot-pro / achatbot
An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.
☆33Updated this week
Alternatives and similar repositories for achatbot:
Users that are interested in achatbot are comparing it to the libraries listed below
- Real time faster whisper gradio☆26Updated 5 months ago
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆51Updated 5 months ago
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆126Updated 9 months ago
- Have a natural voice conversation with an LLM☆246Updated 3 months ago
- An JS web client for connecting to Pipecat bots with voice and vision☆44Updated 3 months ago
- A lightweight end-to-end text-to-speech model☆111Updated last month
- Open Sourced NoteBookLM☆58Updated 6 months ago
- SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems☆81Updated last year
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆86Updated 6 months ago
- An agentic workflow for story book generation☆29Updated 2 weeks ago
- 🎥➡️📝 Hermes: Blazing-fast video transcription powered by AI gods! Transcribe 6.5 minutes of video in just 1 second using Groq's LPU. Ch…☆77Updated 7 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆18Updated 5 months ago
- A basic voice agent built with Python agents framework☆34Updated 2 weeks ago
- SadTalker gradio_demo.py file with code section that allows you to set the eye blink and pose reference videos for the software to use wh…☆11Updated last year
- Jina DeepSearch UI☆85Updated this week
- We Speech Transcript based on LLM, in 300 lines of code.☆156Updated last month
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆35Updated last month
- Deepseek R1 Agent powered by LMStudio and Smolagents☆30Updated 2 months ago
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆89Updated 6 months ago
- ☆173Updated last year
- flow mirror models from JZX AI Labs☆43Updated 6 months ago
- Generate video stories with AI ✨☆32Updated 7 months ago
- Simulates talk with an AI that can express emotions☆63Updated 8 months ago
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆27Updated 6 months ago
- Agent Studio is an AI agent application designed to handle real-time interactions through phone calls, web-based voice user interfaces (V…☆30Updated 4 months ago
- FastAPI + Streamlit interface for OpenAI Whisper-large-v3 with youtube-to-mp3☆23Updated last year
- Local11Labs allows generating high-quality text-to-speech and podcast content using the fast and tiny Kokoro-82M.☆46Updated 2 months ago
- Cross Platform Open Sourced Chinese NoteBookLM app based on Electron, Use DeepSeek + Reecho.ai☆68Updated 4 months ago
- Agentic RAG to help you build a startup🚀☆16Updated 3 weeks ago
- A Full-Duplex Open-Domain Dialogue Agent with Continuous Turn-Taking Behavior☆28Updated last year