gladiaio / gladia-samplesLinks
☆59Updated last month
Alternatives and similar repositories for gladia-samples
Users that are interested in gladia-samples are comparing it to the libraries listed below
Sorting:
- faster-whisper as serverless endpoint☆125Updated 6 months ago
- PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API☆217Updated 3 months ago
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote☆232Updated 9 months ago
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆139Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆97Updated last year
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆217Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆104Updated 5 months ago
- A curated list of awesome OpenAI's Whisper☆98Updated 2 years ago
- streaming speech to text server using Whisper☆96Updated 2 years ago
- Official Python SDK for Deepgram.☆363Updated this week
- A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detection☆115Updated last year
- This package is the Python implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid…☆21Updated last year
- Whisper realtime streaming for long speech-to-text transcription and translation☆121Updated last year
- WIP exploration using Twilio Media Streams and Generative AI☆40Updated last year
- Low latency ai companion voice talk in 60 lines of code using faster_whisper and elevenlabs input streaming☆309Updated 5 months ago
- Use ChatGPT over Twilio to create an AI phone agent (works for incoming or outgoing calls).☆116Updated 2 years ago
- Transcription with speaker diarization pipeline☆97Updated 2 years ago
- A full stack app for interruptible, low-latency and near-human quality AI phone calls built from stitching LLMs, speech understanding too…☆155Updated 6 months ago
- Joint speech-language model - respond directly to audio!☆371Updated last year
- Get started using Deepgram's Live Transcription with this Flask demo app☆40Updated last week
- Thin wrapper around OpenAI Whisper API with streaming support☆89Updated 10 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆119Updated 2 years ago
- Transcription and diarization (speaker identification)☆34Updated 2 years ago
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆102Updated 3 months ago
- Talk to GPT-4 and create a story together.☆91Updated last year
- ☆36Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆68Updated 3 weeks ago
- Demo FastAPI WebSocket Audio☆41Updated 5 years ago
- Example projects built with the Hume AI APIs☆229Updated last week
- A testing repo to share code and thoughts on diarisation☆56Updated last year