DaveDeCaprio / voice-streamLinks
A framework for creating voice based agents. Integrations LLMs with speech recognition and text-to-speech
☆34Updated last year
Alternatives and similar repositories for voice-stream
Users that are interested in voice-stream are comparing it to the libraries listed below
Sorting:
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆141Updated last year
- A basic voice agent built with Python agents framework☆50Updated 4 months ago
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆87Updated last month
- Talk to GPT-4 and create a story together.☆91Updated 2 years ago
- Real-Time Voice Inference Web SDK☆300Updated last week
- A full stack app for interruptible, low-latency and near-human quality AI phone calls built from stitching LLMs, speech understanding too…☆159Updated 9 months ago
- An JS web client for connecting to Pipecat bots with voice and vision☆45Updated last year
- ☆175Updated 2 years ago
- Talking head video AI generator☆82Updated 2 years ago
- a simple system for 2-way interruptible voice interactions between human and LLM☆30Updated last year
- Data Questionnaire Agent Chatbot☆71Updated this week
- Self-hosted AI voice agent☆125Updated last year
- A streaming whisper server for on-prem transcription☆23Updated last year
- WIP exploration using Twilio Media Streams and Generative AI☆40Updated 2 years ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆74Updated 6 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆121Updated 2 years ago
- VideoDB Python SDK☆87Updated this week
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆100Updated last year
- A Flask extension to manage Langchain chat memory and document stores in Flaask apps.☆71Updated 2 years ago
- Use ChatGPT over Twilio to create an AI phone agent (works for incoming or outgoing calls).☆117Updated 2 years ago
- ASR + diarization model server with speculative decoding☆64Updated last year
- The official Cartesia client for Python.☆119Updated this week
- Roomey is a multi-purpose Voice Agent designed to run your personal and business life.☆60Updated 7 months ago
- Daily Client SDK for Python☆64Updated 2 weeks ago
- Speaker diarization service☆26Updated last week
- Agent Studio is an AI agent application designed to handle real-time interactions through phone calls, web-based voice user interfaces (V…☆48Updated last year
- Docs for Ultravox☆43Updated this week
- A library for real-time Speech to Text (STT), and Text to Speech (TTS) capability☆44Updated 2 years ago
- ProfitPilot closes deals for you effortlessly 24/7, just provide a list of customer and ProfitPilot will reach out on your behalf and clo…☆21Updated 2 years ago
- PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API☆220Updated this week