Honghe / demo_fastapi_websocketLinks
Demo FastAPI WebSocket Audio
β40Updated 4 years ago
Alternatives and similar repositories for demo_fastapi_websocket
Users that are interested in demo_fastapi_websocket are comparing it to the libraries listed below
Sorting:
- A streaming whisper server for on-prem transcriptionβ20Updated 10 months ago
- Speech to Speech conversation using the OpenAI RealTime API in Python πβ26Updated 7 months ago
- Ichigo Whisper is a compact (22M parameters), open-source speech tokenizer for the Whisper-medium, designed to enhance performance on mulβ¦β15Updated 5 months ago
- This package is the Python implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a validβ¦β20Updated 8 months ago
- Speaker diarization serviceβ23Updated 2 months ago
- A curated list of awesome voice activity detectionβ57Updated 7 months ago
- A library for real-time Speech to Text (STT), and Text to Speech (TTS) capabilityβ40Updated last year
- β55Updated 2 years ago
- Speaker diarization modelβ27Updated 2 years ago
- Video chat apps with computer vision filters built on top of Streamlitβ50Updated 2 years ago
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)β82Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.β112Updated last year
- A python library to find differences between audio and transcriptionsβ20Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesβ95Updated last year
- Mirror of hf.co/pyannote/speaker-diarization-3.1β23Updated last year
- Zero-shot Audio Classification using Whisperβ79Updated 2 years ago
- A lightweight Python library for running TTS models with a unified API.β20Updated 4 months ago
- A curated list of awesome OpenAI's Whisperβ101Updated last year
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLMβ38Updated last year
- β26Updated 2 years ago
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).β45Updated 10 months ago
- Repo for hosting tutorial code associated with the "AssemblyAI and Python in 5 Minutes" blog by AssemblyAIβ12Updated last year
- β40Updated 2 months ago
- β49Updated 2 years ago
- Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Pythonβ18Updated 2 years ago
- An open-source project that uses cutting-edge NLP models and real-time web search to provide dynamic voice query responses. Features inclβ¦β17Updated last year
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.β25Updated 2 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of codeβ149Updated last year
- Live Transcription With Python FastAPIβ30Updated 3 years ago
- β38Updated 3 years ago