DaveDeCaprio / voice-streamLinks

A framework for creating voice based agents. Integrations LLMs with speech recognition and text-to-speech

☆33

Alternatives and similar repositories for voice-stream

Users that are interested in voice-stream are comparing it to the libraries listed below

Sorting:

bentoml / BentoVoiceAgent
Build Phone Calling Voice Agent fully powered by open source models.
☆46Updated 2 months ago
livekit-examples / voice-pipeline-agent-python
A basic voice agent built with Python agents framework
☆49Updated last month
pipecat-ai / web-client-ui
An JS web client for connecting to Pipecat bots with voice and vision
☆45Updated 6 months ago
onepointconsulting / data-questionnaire-agent
Data Questionnaire Agent Chatbot
☆65Updated last month
lalanikarim / webrtc-ai-voice-chat
A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.
☆133Updated last year
tarzain / crosstalk
a simple system for 2-way interruptible voice interactions between human and LLM
☆29Updated last year
ai-bot-pro / achatbot
An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.
☆55Updated this week
augmentedstartups / Roomey_AI_Voice_Agent
Roomey is a multi-purpose Voice Agent designed to run your personal and business life.
☆27Updated 2 weeks ago
simliai / simli-ai-agent-demo
Simli WebRTC AI Agent demo
☆22Updated 6 months ago
akiani / aidialer
A full stack app for interruptible, low-latency and near-human quality AI phone calls built from stitching LLMs, speech understanding too…
☆137Updated 2 months ago
NidumAI-Inc / agent-studio
Agent Studio is an AI agent application designed to handle real-time interactions through phone calls, web-based voice user interfaces (V…
☆37Updated 7 months ago
Portkey-AI / portkey-python-sdk
Build reliable, secure, and production-ready AI apps easily.
☆73Updated this week
plaggy / fast-whisper-server
ASR + diarization model server with speculative decoding
☆60Updated last year
pipecat-ai / open-sesame
Open Source multi-modal LLM environment. Host your own web and mobile chat interface, powered by real-time bots and voice AI functionalit…
☆43Updated 6 months ago
video-db / videodb-python
VideoDB Python SDK
☆73Updated this week
AIAnytime / On-device-LLM-Inference-using-Mediapipe
On-device LLM Inference using Mediapipe LLM Inference API.
☆21Updated last year
mendableai / gen-ui-firecrawl
☆46Updated last year
bklieger-groq / NotebookLlama-Groq
NotebookLlama powered by Groq - Create podcasts on any topic lightning fast
☆73Updated 8 months ago
truemagic-coder / nemo-agent
Your Python AI Coder!
☆34Updated last month
deepgram-starters / flask-live-transcription
Get started using Deepgram's Live Transcription with this Flask demo app
☆35Updated last week
daily-demos / llm-talk
Talk to GPT-4 and create a story together.
☆90Updated last year
deepgram-devs / deepgram-twilio-streaming-python
a Demo of Deepgram & Twilio that allows multiple client subscribers to watch live transcripts from ongoing Twilio calls.
☆18Updated last year
Anil-matcha / AI-Voice-Agent
Self-hosted AI voice agent
☆109Updated 10 months ago
mallahyari / RealtimeSTT-TTS
A library for real-time Speech to Text (STT), and Text to Speech (TTS) capability
☆40Updated last year
mckaywrigley / realtime-ai-livekit-playground
Play with OpenAI's new Realtime API in your browser
☆25Updated 8 months ago
craigsdennis / genai-phone-call
WIP exploration using Twilio Media Streams and Generative AI
☆40Updated last year
catid / aiwebcam2
Second attempt at AI webcam, this time with OpenAI API
☆39Updated last year
pipecat-ai / pipecat-client-web
Real-Time Voice Inference Web SDK
☆251Updated this week
RetellAI / retell-custom-llm-python-demo
☆63Updated 8 months ago
rooms-solutions / csm-multilingual
Multilingual extension of the SesameAILabs Conversational Speech Generation Model
☆26Updated 3 months ago