ai-bot-pro / achatbotLinks

An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.

☆64

Alternatives and similar repositories for achatbot

Users that are interested in achatbot are comparing it to the libraries listed below

Sorting:

Finity-Alpha / OpenVoiceChat
Have a natural voice conversation with an LLM
☆251Updated 7 months ago
lovemefan / SenseVoice-python
SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime
☆97Updated 10 months ago
wenet-e2e / wesr
We Speech Transcript based on LLM, in 300 lines of code.
☆174Updated last month
mush42 / optispeech
A lightweight end-to-end text-to-speech model
☆117Updated 5 months ago
lalanikarim / webrtc-ai-voice-chat
A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.
☆135Updated last year
maitrix-org / Voila
☆429Updated 2 months ago
eustlb / speech-to-speech
Speech To Speech: an effort for an open-sourced and modular GPT4-o
☆66Updated 9 months ago
xinchen-ai / Westlake-Omni
☆201Updated 10 months ago
ictnlp / LLaMA-Omni2
☆207Updated 2 months ago
fishaudio / fish-audio-python
☆123Updated 2 months ago
Ninot1Quyi / Qwen2.5-Omni-multimodal-chat
基于通义千问 Qwen2.5-Omni 的实时语音对话系统，使用在线API服务，支持实时语音交互、动态语音活动检测和流式音频处理。A real-time voice conversation system based on Qwen2.5-Omni Online-API, …
☆60Updated 2 months ago
KoljaB / stream2sentence
Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.
☆68Updated 3 weeks ago
liu-qingyuan / faster_whisper_gradio
Real time faster whisper gradio
☆26Updated 9 months ago
ruzhila / voiceapi
Streaming ASR and TTS based on FastAPI+ sherpa-onnx
☆134Updated 3 months ago
DakeQQ / F5-TTS-ONNX
Running the F5-TTS by ONNX Runtime
☆170Updated this week
KoljaB / WhoSpeaks
Efficient approach to speaker diarization using voice characteristics extraction
☆97Updated last month
gpustack / vox-box
A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.
☆146Updated 2 weeks ago
pavelzbornik / whisperX-FastAPI
FastAPI service on top of WhisperX
☆120Updated this week
livekit-examples / voice-pipeline-agent-python
A basic voice agent built with Python agents framework
☆50Updated 2 weeks ago
warmshao / ChatTTSPlus
Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment
☆169Updated 5 months ago
0nutation / SpeechAgents
SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems
☆82Updated last year
coqui-ai / xtts-streaming-server
☆336Updated last year
jingzhunxue / flow_mirror
flow mirror models from JZX AI Labs
☆44Updated 10 months ago
WeberJulian / AI-voice-chat
☆175Updated last year
kyutai-labs / moshivis
Kyutai with an "eye"
☆212Updated 4 months ago
WGS-note / F5_TTS_Faster
F5-TTS 推理加速，速度提升约4倍！
☆102Updated 6 months ago
jianchang512 / sense-api
用于SenseVoice的api项目，输出带时间戳字幕
☆38Updated 9 months ago
xinliu9451 / awesome-denoiser
This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …
☆40Updated 8 months ago
mallahyari / RealtimeSTT-TTS
A library for real-time Speech to Text (STT), and Text to Speech (TTS) capability
☆41Updated last year
byteresearchcla / RealSI
RealSI: Open Benchmark for Simultaneous Interpretation in Real-world Scenarios
☆62Updated last month