ai-bot-pro / achatbotLinks
An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.
☆53Updated this week
Alternatives and similar repositories for achatbot
Users that are interested in achatbot are comparing it to the libraries listed below
Sorting:
- Real time faster whisper gradio☆26Updated 7 months ago
- flow mirror models from JZX AI Labs☆45Updated 8 months ago
- A basic voice agent built with Python agents framework☆45Updated last month
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆62Updated 7 months ago
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆132Updated 11 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆94Updated 8 months ago
- SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems☆82Updated last year
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆35Updated 3 months ago
- 基于通义千问 Qwen2.5-Omni 的实时语音对话系统,使用在线API服务,支持实时语音交互、动态语音活动检测和流式音频处理。A real-time voice conversation system based on Qwen2.5-Omni Online-API, …☆54Updated 3 weeks ago
- We Speech Transcript based on LLM, in 300 lines of code.☆162Updated this week
- SadTalker gradio_demo.py file with code section that allows you to set the eye blink and pose reference videos for the software to use wh…☆11Updated last year
- This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS …☆12Updated 2 months ago
- 用于SenseVoice的api项目,输出带时间戳字幕☆34Updated 7 months ago
- Multilingual extension of the SesameAILabs Conversational Speech Generation Model☆26Updated 2 months ago
- ☆31Updated this week
- RAGLight is a lightweight and modular Python library for implementing Retrieval-Augmented Generation (RAG), Agentic RAG and RAT (Retrieva…☆25Updated 2 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆19Updated 7 months ago
- A Full-Duplex Open-Domain Dialogue Agent with Continuous Turn-Taking Behavior☆31Updated last year
- Open TTS models, built for streaming on the edge☆43Updated 2 months ago
- ☆198Updated 8 months ago
- Example agents I've built using the LiveKit Agents (https://github.com/livekit/agents) framework☆19Updated last year
- An JS web client for connecting to Pipecat bots with voice and vision☆44Updated 5 months ago
- ☆74Updated last year
- A library for real-time Speech to Text (STT), and Text to Speech (TTS) capability☆40Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated last year
- A minimalistic streamlit chatbot UI to combine and customize tools for langchain llm agents☆13Updated last year
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆23Updated 2 months ago
- Dynamic Voice Actor Assignment and Emotional Narration for Realistic Story Play☆40Updated 2 months ago
- xllamacpp - a Python wrapper of llama.cpp☆40Updated last week
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆100Updated 2 years ago