ai-bot-pro / achatbot
An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run; if u run achatbot by yourself, u can learn more, star and fork to contribute~
☆20Updated this week
Alternatives and similar repositories for achatbot:
Users that are interested in achatbot are comparing it to the libraries listed below
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆117Updated 6 months ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆34Updated 3 weeks ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆77Updated 3 months ago
- A gradio webui for Andrewyng translation-agent☆27Updated last month
- We Speech Transcript based on LLM, in 300 lines of code.☆136Updated this week
- Real time faster whisper gradio☆26Updated 3 months ago
- SadTalker gradio_demo.py file with code section that allows you to set the eye blink and pose reference videos for the software to use wh…☆11Updated last year
- SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems☆78Updated last year
- A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting multiple languages.☆30Updated this week
- flow mirror models from JZX AI Labs☆43Updated 3 months ago
- 用于SenseVoice的api项目,输出带时间戳字幕☆18Updated 2 months ago
- Have a natural voice conversation with an LLM☆235Updated last month
- 百聆 是一个类似GPT-4o的语音对话机器人,通过ASR+LLM+TTS实现,时延低至800ms,低配置也可运行,支持打断☆63Updated 3 weeks ago
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆26Updated 3 months ago
- Speech Diarization for scrum automation☆101Updated last year
- ☆12Updated last month
- Example agents I've built using the LiveKit Agents (https://github.com/livekit/agents) framework☆17Updated 7 months ago
- Realtime Video and Audio Streaming with WebRTC and Gradio☆151Updated this week
- ☆13Updated 10 months ago
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆57Updated 3 months ago
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆30Updated last month
- 🎥➡️📝 Hermes: Blazing-fast video transcription powered by AI gods! Transcribe 6.5 minutes of video in just 1 second using Groq's LPU. Ch…☆68Updated 4 months ago
- 🍳 AyaMCooking is a Voice-to-Voice Mutli-lingual RAG Agent that makes a perfect sous chef for your kitchen, in upto 10 Languages 🤌🧑🍳☆21Updated 2 months ago
- Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models☆14Updated last year
- Tracking the hot Github repos and update daily 每天自动追踪Github热门项目☆44Updated this week
- Agent Studio is an AI agent application designed to handle real-time interactions through phone calls, web-based voice user interfaces (V…☆24Updated 2 months ago
- FastAPI service on top of WhisperX☆61Updated this week
- Efficient approach to speaker diarization using voice characteristics extraction☆75Updated 8 months ago
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆51Updated 3 years ago
- ☆76Updated 8 months ago