DaveDeCaprio / voice-stream
A framework for creating voice based agents. Integrations LLMs with speech recognition and text-to-speech
☆33Updated 10 months ago
Alternatives and similar repositories for voice-stream:
Users that are interested in voice-stream are comparing it to the libraries listed below
- Data Questionnaire Agent Chatbot☆64Updated 3 weeks ago
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆126Updated 9 months ago
- An JS web client for connecting to Pipecat bots with voice and vision☆43Updated 3 months ago
- Agent Studio is an AI agent application designed to handle real-time interactions through phone calls, web-based voice user interfaces (V…☆30Updated 4 months ago
- Theraxus AI: A modular conversational AI platform ⚙️ blending STT 🎙️, TTS 🗣️, and RAG 📚 for seamless, context-aware dialogues and huma…☆25Updated 4 months ago
- A full stack app for interruptible, low-latency and near-human quality AI phone calls built from stitching LLMs, speech understanding too…☆124Updated 7 months ago
- MeetNote2 - Zoom Auto-Recording & Transcription App☆14Updated 2 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆113Updated last year
- Docs for Ultravox☆30Updated last week
- Choose a topic, a music genre and wait for the agents to generate a song☆53Updated 9 months ago
- FastAPI service on top of WhisperX☆77Updated this week
- Build Phone Calling Voice Agent fully powered by open source models.☆31Updated last week
- Serving CrewAI Agent as REST API with BentoML, optionally with self-host open-source LLMs☆16Updated 3 months ago
- WIP exploration using Twilio Media Streams and Generative AI☆39Updated last year
- Talk to GPT-4 and create a story together.☆88Updated last year
- a simple system for 2-way interruptible voice interactions between human and LLM☆23Updated last year
- A library for real-time Speech to Text (STT), and Text to Speech (TTS) capability☆36Updated last year
- A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) within reach of everyone, particu…☆34Updated last year
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆33Updated this week
- ProfitPilot closes deals for you effortlessly 24/7, just provide a list of customer and ProfitPilot will reach out on your behalf and clo…☆22Updated last year
- ASR + diarization model server with speculative decoding☆59Updated 10 months ago
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Updated 11 months ago
- ☆21Updated 11 months ago
- Build reliable, secure, and production-ready AI apps easily.☆70Updated this week
- Conduct consumer interviews with synthetic focus groups using LLMs and LangChain☆43Updated last year
- ☆59Updated last year
- Self-hosted AI voice agent☆94Updated 7 months ago
- VideoDB Python SDK☆64Updated this week
- Joint speech-language model - respond directly to audio!☆30Updated 10 months ago
- ☆61Updated 5 months ago