ai-bot-pro / achatbotLinks
An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.
☆83Updated last week
Alternatives and similar repositories for achatbot
Users that are interested in achatbot are comparing it to the libraries listed below
Sorting:
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆104Updated 3 weeks ago
- Have a natural voice conversation with an LLM☆260Updated 3 weeks ago
- We Speech Transcript based on LLM, in 300 lines of code.☆177Updated 4 months ago
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆138Updated last year
- ☆461Updated 5 months ago
- ☆234Updated 5 months ago
- ☆204Updated last year
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆156Updated 6 months ago
- A lightweight end-to-end text-to-speech model☆123Updated 8 months ago
- Running the F5-TTS by ONNX Runtime☆179Updated last month
- Kyutai with an "eye"☆222Updated 7 months ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆70Updated 3 months ago
- Service for testing out the new Qwen2.5 omni model☆61Updated 6 months ago
- Utilizes ONNX Runtime to transcribe audio into text.☆57Updated last month
- Ming-UniAudio: Speech LLM for Joint Understanding, Generation and Editing with Unified Representation☆280Updated 2 weeks ago
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆165Updated 3 months ago
- Real time faster whisper gradio☆26Updated 2 months ago
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆72Updated last year
- ☆174Updated last year
- FastAPI service on top of WhisperX☆141Updated last week
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆68Updated last week
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆48Updated 10 months ago
- A basic voice agent built with Python agents framework☆50Updated 3 weeks ago
- F5-TTS 推理加速,速度提升约4倍!☆115Updated 9 months ago
- ☆348Updated last year
- A Full-Duplex Open-Domain Dialogue Agent with Continuous Turn-Taking Behavior☆33Updated 2 years ago
- ☆466Updated 5 months ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆114Updated 2 years ago
- ☆280Updated 2 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆104Updated 4 months ago