ai-bot-pro / achatbotLinks
An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.
☆55Updated this week
Alternatives and similar repositories for achatbot
Users that are interested in achatbot are comparing it to the libraries listed below
Sorting:
- This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS …☆12Updated 3 months ago
- SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems☆82Updated last year
- flow mirror models from JZX AI Labs☆44Updated 8 months ago
- Real time faster whisper gradio☆26Updated 8 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated 3 weeks ago
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆40Updated 6 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆95Updated 9 months ago
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆132Updated last year
- We Speech Transcript based on LLM, in 300 lines of code.☆164Updated this week
- Service for testing out the new Qwen2.5 omni model☆52Updated last month
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆35Updated 4 months ago
- Open TTS models, built for streaming on the edge☆43Updated 3 months ago
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆62Updated 8 months ago
- Multilingual extension of the SesameAILabs Conversational Speech Generation Model☆26Updated 3 months ago
- Dynamic Voice Actor Assignment and Emotional Narration for Realistic Story Play☆40Updated 2 months ago
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆24Updated 3 months ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆90Updated last month
- A basic voice agent built with Python agents framework☆49Updated last month
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆24Updated last month
- ☆188Updated last month
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆63Updated last week
- A lightweight end-to-end text-to-speech model☆114Updated 4 months ago
- An JS web client for connecting to Pipecat bots with voice and vision☆45Updated 6 months ago
- ☆200Updated 9 months ago
- mnn asr demo.☆20Updated 3 months ago
- Implementation of Sesame's Conversational Speech Model for Hugging Face Transformers☆56Updated last month
- The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.☆35Updated 9 months ago
- AI tool for auto-research, TTS, and Graphical assembly into a completed Podcast☆70Updated last week
- 基于通义千问 Qwen2.5-Omni 的实时语音对话系统,使用在线API服务,支持实时语音交互、动态语音活动检测和流式音频处理。A real-time voice conversation system based on Qwen2.5-Omni Online-API, …☆55Updated last month
- FastAPI + Streamlit interface for OpenAI Whisper-large-v3 with youtube-to-mp3☆25Updated last year