ai-bot-pro / achatbotLinks
An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.
☆87Updated last week
Alternatives and similar repositories for achatbot
Users that are interested in achatbot are comparing it to the libraries listed below
Sorting:
- Have a natural voice conversation with an LLM☆259Updated 2 months ago
- We Speech Transcript based on LLM, in 300 lines of code.☆181Updated 6 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆108Updated 2 months ago
- ☆204Updated last year
- ☆251Updated 7 months ago
- Running the F5-TTS by ONNX Runtime☆188Updated last month
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆177Updated last month
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆139Updated last year
- ☆482Updated 7 months ago
- Kyutai with an "eye"☆232Updated 9 months ago
- GLM-ASR-Nano: A robust, open-source speech recognition model with 1.5B parameters☆613Updated 2 weeks ago
- ☆472Updated 7 months ago
- Fun-Audio-Chat is a Large Audio Language Model built for natural, low-latency voice interactions.☆297Updated last week
- Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.☆583Updated this week
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆174Updated 10 months ago
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆186Updated last week
- Real time faster whisper gradio☆25Updated 4 months ago
- A lightweight end-to-end text-to-speech model☆125Updated 10 months ago
- Utilizes ONNX Runtime to transcribe audio into text.☆67Updated this week
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆72Updated 5 months ago
- ☆175Updated 2 years ago
- A basic voice agent built with Python agents framework☆50Updated 2 months ago
- 基于通义千问 Qwen2.5-Omni 的实时语音对话系统,使用在线API服务,支持实时语音交互、动态语音活动检测和流式音频处理。A real-time voice conversation system based on Qwen2.5-Omni Online-API, …☆83Updated 7 months ago
- Open source inference code for Rev's model☆435Updated 8 months ago
- F5-TTS 推理加速,速度提升约4倍!☆121Updated 11 months ago
- The official Python library for the Fish Audio API.☆136Updated this week
- This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS …☆14Updated 4 months ago
- flow mirror models from JZX AI Labs☆43Updated last year
- Ming-UniAudio: Speech LLM for Joint Understanding, Generation and Editing with Unified Representation☆413Updated last month
- A collection of optimized utilities for text-to-audio processing, enhancing both training and inference workflows. This repository contai…☆42Updated 8 months ago