TwilioDevEd / mediastreams-consume-websockets-flask
Tutorial for using Twilio Media Streams
β23Updated 2 months ago
Alternatives and similar repositories for mediastreams-consume-websockets-flask:
Users that are interested in mediastreams-consume-websockets-flask are comparing it to the libraries listed below
- On-device speaker recognition engine powered by deep learningβ32Updated 3 weeks ago
- ππΌ Build your Documentation AI with Nemo Guardrailsβ13Updated 4 months ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.β44Updated 3 weeks ago
- WIP exploration using Twilio Media Streams and Generative AIβ39Updated last year
- Demo FastAPI WebSocket Audioβ39Updated 4 years ago
- Talk to GPT-4 and create a story together.β88Updated last year
- This package is the Python implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a validβ¦β19Updated 5 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesβ94Updated 10 months ago
- An JS web client for connecting to Pipecat bots with voice and visionβ43Updated 2 months ago
- π§ | RunPod worker of the faster-whisper model for Serverless Endpoint.β87Updated last month
- Runpod WhisperX Docker Container Repoβ13Updated last year
- Powered by OpenAI Whisper & Gradioβ30Updated 2 years ago
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.β124Updated 8 months ago
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, andβ¦β51Updated this week
- β35Updated 5 months ago
- β99Updated last year
- Get started using Deepgram's Live Transcription with this Flask demo appβ29Updated this week
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.β60Updated last year
- β12Updated last year
- Use Deepgram to transcribe tab (and mic) audioβ21Updated 2 years ago
- π¬ ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.β205Updated 4 months ago
- Efficient approach to speaker diarization using voice characteristics extractionβ91Updated 10 months ago
- A streaming whisper server for on-prem transcriptionβ20Updated 6 months ago
- β33Updated 8 months ago
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.β32Updated this week
- On-device streaming text-to-speech engine powered by deep learningβ71Updated last week
- β22Updated 10 months ago
- Second attempt at AI webcam, this time with OpenAI APIβ37Updated last year
- RealVoiceGPT is a web application that lets you have voice conversations with ChatGPT. The project uses ElevenLabs AI text to speech to gβ¦β30Updated last year