DaveDeCaprio / voice-streamLinks
A framework for creating voice based agents. Integrations LLMs with speech recognition and text-to-speech
☆33Updated last year
Alternatives and similar repositories for voice-stream
Users that are interested in voice-stream are comparing it to the libraries listed below
Sorting:
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆136Updated last year
- a simple system for 2-way interruptible voice interactions between human and LLM☆30Updated last year
- Data Questionnaire Agent Chatbot☆69Updated 3 months ago
- An JS web client for connecting to Pipecat bots with voice and vision☆45Updated 8 months ago
- Build Phone Calling Voice Agent fully powered by open source models.☆54Updated 4 months ago
- A library for real-time Speech to Text (STT), and Text to Speech (TTS) capability☆43Updated last year
- A basic voice agent built with Python agents framework☆48Updated last month
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆97Updated last year
- VideoDB Python SDK☆80Updated this week
- A full stack app for interruptible, low-latency and near-human quality AI phone calls built from stitching LLMs, speech understanding too…☆149Updated 4 months ago
- Mission to create a Hebrew TTS model as powerful and user-friendly as WaveNet☆35Updated 8 months ago
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆68Updated this week
- Roomey is a multi-purpose Voice Agent designed to run your personal and business life.☆49Updated 2 months ago
- Self-hosted AI voice agent☆113Updated last year
- ☆175Updated last year
- Real-Time Voice Inference Web SDK☆280Updated last week
- Have a natural voice conversation with an LLM☆255Updated 8 months ago
- Transcribe audio and video files with speaker diarization and logically grouped timestamps using Gemini Flash☆38Updated 2 months ago
- ☆39Updated last month
- Daily Client SDK for Python☆63Updated last week
- ProfitPilot closes deals for you effortlessly 24/7, just provide a list of customer and ProfitPilot will reach out on your behalf and clo…☆21Updated last year
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆95Updated 2 weeks ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆121Updated last year
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆23Updated this week
- Talk to GPT-4 and create a story together.☆91Updated last year
- Docs for Ultravox☆42Updated last week
- A streaming whisper server for on-prem transcription☆21Updated last year
- A simple client and utils for interacting with OpenAI's Realtime API in Python☆239Updated 3 months ago
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆12Updated 11 months ago
- ASR + diarization model server with speculative decoding☆62Updated last year