AbrahamSanders / realtime-chatbotLinks
A Full-Duplex Open-Domain Dialogue Agent with Continuous Turn-Taking Behavior
☆36Updated 2 years ago
Alternatives and similar repositories for realtime-chatbot
Users that are interested in realtime-chatbot are comparing it to the libraries listed below
Sorting:
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆87Updated last month
- ☆175Updated 2 years ago
- SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems☆85Updated 2 years ago
- A collection of optimized utilities for text-to-audio processing, enhancing both training and inference workflows. This repository contai…☆44Updated 10 months ago
- Awesome TTS☆62Updated 4 years ago
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆142Updated last year
- livekit agent plugins☆36Updated this week
- ASR + diarization model server with speculative decoding☆64Updated last year
- ☆204Updated last year
- ☆258Updated last year
- ☆486Updated 9 months ago
- Its an open source LLM based on MOE Structure.☆58Updated last year
- Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models☆15Updated 2 years ago
- flow mirror models from JZX AI Labs☆43Updated last year
- Your customized AI assistant - Personal assistants on any hardware! With llama.cpp, whisper.cpp, ggml, LLaMA-v2.☆120Updated 2 years ago
- ☆360Updated last year
- Have a natural voice conversation with an LLM☆262Updated 3 weeks ago
- Multimodal Open Source Framework for Conversational Agent Research and Development.☆22Updated 11 months ago
- LSLM implements full duplex modeling in interactive speech language models, based on research by Ma et al. (2024). This project advances …☆85Updated 7 months ago
- ☆261Updated 8 months ago
- RealSI: Open Benchmark for Simultaneous Interpretation in Real-world Scenarios☆79Updated 7 months ago
- Kyutai with an "eye"☆236Updated 10 months ago
- A library for real-time Speech to Text (STT), and Text to Speech (TTS) capability☆44Updated 2 years ago
- EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction☆267Updated last year
- A lightweight end-of-utterance detection model fine-tuned on SmolLM2-135M, optimized for Raspberry Pi and low-power devices.☆45Updated 3 months ago
- ☆13Updated last year
- Incredibly descriptive audiovisual summaries for videos☆41Updated last year
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆52Updated 4 years ago
- an improved version of Real-time-voice-cloning☆52Updated last year
- 基于通义千问 Qwen2.5-Omni 的实时语音对话系统,使用在线API服务,支持实时语音交互、动态语音活动检测和流式音频处理。A real-time voice conversation system based on Qwen2.5-Omni Online-API, …☆82Updated 9 months ago