voxos-ai / streaming-whisper-server
A streaming whisper server for on-prem transcription
☆20Updated 7 months ago
Alternatives and similar repositories for streaming-whisper-server:
Users that are interested in streaming-whisper-server are comparing it to the libraries listed below
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆60Updated last week
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated 10 months ago
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆71Updated 9 months ago
- Speech to Speech conversation using the OpenAI RealTime API in Python 🐍☆23Updated 4 months ago
- 🍳 AyaMCooking is a Voice-to-Voice Mutli-lingual RAG Agent that makes a perfect sous chef for your kitchen, in upto 10 Languages 🤌🧑🍳☆21Updated 4 months ago
- ☆155Updated last year
- Joint speech-language model - respond directly to audio!☆30Updated 10 months ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆51Updated 3 months ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆37Updated last year
- 🎧 | RunPod worker of the faster-whisper model for Serverless Endpoint.☆89Updated last month
- Whisper realtime streaming for long speech-to-text transcription and translation☆113Updated last year
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).☆44Updated 7 months ago
- This package is the Python implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid…☆19Updated 5 months ago
- ☆1Updated 8 months ago
- Speaker diarization service☆21Updated last month
- Speaker Diarization with Transformers☆64Updated 10 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆92Updated 11 months ago
- FastAPI service on top of WhisperX☆72Updated this week
- ☆38Updated last year
- ☆201Updated 9 months ago
- ☆47Updated last year
- Repository featuring fine-tuning code for various LLMs, complemented by occasional explanations, deep dives.☆39Updated 6 months ago
- Pandas-LLM☆39Updated last year
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆126Updated 9 months ago
- An JS web client for connecting to Pipecat bots with voice and vision☆43Updated 3 months ago
- Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python☆18Updated last year
- ☆31Updated last year
- Cog wrapper for collabora/WhisperSpeech☆25Updated last year