ai-bot-pro / achatbot
An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.
☆49Updated this week
Alternatives and similar repositories for achatbot
Users that are interested in achatbot are comparing it to the libraries listed below
Sorting:
- flow mirror models from JZX AI Labs☆45Updated 7 months ago
- 基于通义千问 Qwen2.5-Omni 的实时语音 对话系统,使用在线API服务,支持实时语音交互、动态语音活动检测和流式音频处理。A real-time voice conversation system based on Qwen2.5-Omni Online-API, …☆49Updated this week
- SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems☆81Updated last year
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆59Updated 7 months ago
- We Speech Transcript based on LLM, in 300 lines of code.☆160Updated 3 weeks ago
- A basic voice agent built with Python agents framework☆44Updated last week
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆130Updated 10 months ago
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆37Updated 5 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆91Updated 7 months ago
- This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS …☆12Updated last month
- Open TTS models, built for streaming on the edge☆41Updated 2 months ago
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)☆16Updated 4 months ago
- Real time faster whisper gradio☆26Updated 7 months ago
- The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.☆34Updated 8 months ago
- 基于FunASR实现语音识别,包含常规版和ONNX版(推荐)。☆39Updated 7 months ago
- A lightweight end-to-end text-to-speech model☆113Updated 2 months ago
- Dynamic Voice Actor Assignment and Emotional Narration for Realistic Story Play☆40Updated last month
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆35Updated 3 months ago
- 用于SenseVoice的api项目,输出带时间戳字幕☆34Updated 6 months ago
- A minimalistic streamlit chatbot UI to combine and customize tools for langchain llm agents☆13Updated last year
- Added vLLM support to IndexTTS for faster inference.☆87Updated this week
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆27Updated 7 months ago
- ☆195Updated 7 months ago
- Have a natural voice conversation with an LLM☆247Updated 5 months ago
- An JS web client for connecting to Pipecat bots with voice and vision☆44Updated 4 months ago
- Multilingual extension of the SesameAILabs Conversational Speech Generation Model☆25Updated last month
- TinyClick: Single-Turn Agent for Empowering GUI Automation☆33Updated 6 months ago
- xllamacpp - a Python wrapper of llama.cpp☆36Updated this week
- FastAPI service on top of WhisperX☆95Updated this week
- LlamaVoice is a llama-based large voice generation model, providing inference and training ability.☆233Updated 8 months ago