DaveDeCaprio / voice-streamLinks
A framework for creating voice based agents. Integrations LLMs with speech recognition and text-to-speech
☆34Updated last year
Alternatives and similar repositories for voice-stream
Users that are interested in voice-stream are comparing it to the libraries listed below
Sorting:
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆141Updated last year
- Data Questionnaire Agent Chatbot☆69Updated last month
- A basic voice agent built with Python agents framework☆50Updated 2 months ago
- ASR + diarization model server with speculative decoding☆63Updated last year
- Talking head video AI generator☆81Updated last year
- A full stack app for interruptible, low-latency and near-human quality AI phone calls built from stitching LLMs, speech understanding too…☆157Updated 7 months ago
- An JS web client for connecting to Pipecat bots with voice and vision☆44Updated 11 months ago
- VideoDB Python SDK☆84Updated this week
- Talk to GPT-4 and create a story together.☆91Updated 2 years ago
- ☆175Updated 2 years ago
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆88Updated last week
- WIP exploration using Twilio Media Streams and Generative AI☆40Updated last year
- Docs for Ultravox☆43Updated this week
- Build Phone Calling Voice Agent fully powered by open source models.☆62Updated 7 months ago
- Agent Studio is an AI agent application designed to handle real-time interactions through phone calls, web-based voice user interfaces (V…☆48Updated last year
- Whisper realtime streaming for long speech-to-text transcription and translation☆121Updated last year
- Real-Time Voice Inference Web SDK☆292Updated last week
- a simple system for 2-way interruptible voice interactions between human and LLM☆30Updated last year
- ☆36Updated 2 years ago
- AI Lip Syncing application, deployed on Streamlit☆43Updated last year
- A library for real-time Speech to Text (STT), and Text to Speech (TTS) capability☆44Updated 2 years ago
- A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) within reach of everyone, particu…☆37Updated last year
- Daily Bots Web Demo showcasing how to build real-time voice AI agents☆246Updated 3 months ago
- A lightweight Python library for running TTS models with a unified API.☆21Updated 9 months ago
- ☆37Updated 2 years ago
- Daily Client SDK for Python☆64Updated 3 weeks ago
- 🎥➡️📝 Hermes: Blazing-fast video transcription powered by AI gods! Transcribe 6.5 minutes of video in just 1 second using Groq's LPU. Ch…☆79Updated last year
- All public LiveKit repos as a common repo to make searching and LLM inference easier.☆19Updated last week
- PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API☆217Updated 3 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆23Updated last year