An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.
☆87Dec 28, 2025Updated 2 months ago
Alternatives and similar repositories for achatbot
Users that are interested in achatbot are comparing it to the libraries listed below
Sorting:
- This demo showcases different approaches to handling the delay during RAG (Retrieval-Augmented Generation) lookups in a voice-enabled AI …☆20Jan 23, 2025Updated last year
- ☆23Oct 30, 2024Updated last year
- Generate music videos starring yourself.☆11Apr 3, 2025Updated 11 months ago
- Personal AI assistant platform, rewritten in Rust from OpenClaw☆34Feb 28, 2026Updated last week
- ☆24Oct 8, 2021Updated 4 years ago
- Launch your speech synthesis within one minute.☆12May 6, 2024Updated last year
- This project implements a Hybrid Retrieval-Augmented Generation (RAG) approach with knowledge graphs for interacting with PDF documents. …☆13Oct 12, 2025Updated 4 months ago
- This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS …☆14Aug 8, 2025Updated 6 months ago
- Textless Speech-to-Music Retrieval Using Emotion Similarity [ICASSP23]☆17Aug 16, 2023Updated 2 years ago
- 这是一个在wav2lip,使用wav2lip、gfpgan、yolov5等模型用RT加速的超快推理!经测试在2070显卡上可达到0.03秒每帧实现实时推理。☆31Sep 23, 2025Updated 5 months ago
- A framework for creating voice based agents. Integrations LLMs with speech recognition and text-to-speech☆34May 1, 2024Updated last year
- ☆16Updated this week
- The UnisonAI Multi-Agent Framework built on custom workflow which allows ai agents to talk together and provides a flexible and extensibl…☆23Feb 24, 2026Updated last week
- LiveKit + Next.js AI voice agent interface☆16Feb 21, 2025Updated last year
- Daily Bots Web Demo showcasing how to build real-time voice AI agents☆248Sep 10, 2025Updated 5 months ago
- ☆30Oct 21, 2025Updated 4 months ago
- ☆13Mar 7, 2024Updated 2 years ago
- Python的音频工具☆16Dec 5, 2025Updated 3 months ago
- [WACV 2026] LASER: Lip Landmark Assisted Speaker Detection for Robustness official implemntation☆22Feb 26, 2026Updated last week
- Adapting Vercel's AI chatbot to use LiveKit as the transport☆20Mar 26, 2025Updated 11 months ago
- Pytorch reimplementation of audio driven face mesh or blendshape models, including Audio2Mesh, VOCA, etc☆17Sep 6, 2024Updated last year
- Minimal ecommerce store built with Next.js, inspired by yeezy.com.☆22Dec 30, 2024Updated last year
- 基于DINet的推理服务,推理视频流和视频☆17Nov 8, 2023Updated 2 years ago
- audiolm-pytorch training code☆15Jul 31, 2023Updated 2 years ago
- ☆38Nov 10, 2024Updated last year
- Heteronym to Phoneme Parser☆19Nov 4, 2023Updated 2 years ago
- An open-source project that uses cutting-edge NLP models and real-time web search to provide dynamic voice query responses. Features incl…☆20May 24, 2024Updated last year
- Spliting the ASR probability distribution results into the chinese pinyin, so as to extract more effective feature for the chinese speech…☆21Mar 16, 2023Updated 2 years ago
- Chinese and English Bilinguish G2P☆22Jul 16, 2023Updated 2 years ago
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆27Aug 1, 2023Updated 2 years ago
- Utilizes ONNX Runtime for TTS model.☆50Feb 21, 2026Updated 2 weeks ago
- ☆20Oct 24, 2025Updated 4 months ago
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆143Jun 18, 2024Updated last year
- [ICIAP 2023] Learning Landmarks Motion from Speech for Speaker-Agnostic 3D Talking Heads Generation☆62Dec 12, 2023Updated 2 years ago
- Generating Dance steps for given music with deep learning☆21Dec 21, 2018Updated 7 years ago
- Sky LiveKit Agent Perplexica is a local, free solution integrating LiveKit with advanced internet search. It uses a local Perplexica inst…☆28Feb 6, 2025Updated last year
- This is a sample example repo on how to extend Vapi functionalities and deploy it on Vercel Edge Functions.☆24Jul 8, 2024Updated last year
- [EMNLP 2024] Ask-before-Plan: Proactive Language Agents for Real-World Planning☆21Jul 28, 2025Updated 7 months ago
- Experimental JSON to ffmpeg filter complex converter☆62Aug 1, 2024Updated last year