Honghe / demo_fastapi_websocketLinks
Demo FastAPI WebSocket Audio
☆40Updated 4 years ago
Alternatives and similar repositories for demo_fastapi_websocket
Users that are interested in demo_fastapi_websocket are comparing it to the libraries listed below
Sorting:
- A streaming whisper server for on-prem transcription☆20Updated 9 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆112Updated last year
- Speaker diarization model☆27Updated 2 years ago
- Speech to Speech conversation using the OpenAI RealTime API in Python 🐍☆26Updated 6 months ago
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆81Updated last year
- Tunable pipelines☆34Updated 3 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated last year
- Video chat apps with computer vision filters built on top of Streamlit☆50Updated 2 years ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Updated 2 years ago
- A python library to find differences between audio and transcriptions☆20Updated last year
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆12Updated 8 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated last week
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆149Updated last year
- Zero-shot Audio Classification using Whisper☆79Updated 2 years ago
- Tutorial for using Twilio Media Streams☆24Updated 5 months ago
- Speaker diarization service☆23Updated last month
- faster-whisper as serverless endpoint☆105Updated 2 weeks ago
- ☆55Updated 2 years ago
- Open TTS models, built for streaming on the edge☆43Updated 2 months ago
- Transcription and diarization (speaker identification)☆33Updated 2 years ago
- A curated list of awesome voice activity detection☆55Updated 6 months ago
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆27Updated 10 months ago
- 🤗 Huggingface + ⚡ FastAPI = ❤️ Awesomeness☆56Updated 2 years ago
- This package is the Python implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid…☆20Updated 7 months ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆57Updated last month
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆45Updated last year
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆83Updated last year
- Video upload and stream api server created using FastAPI(Python) and SQLModel(SQLAlchemy) ORM☆34Updated 3 years ago
- Copy My Writing is a command-line tool for generating content based on your personal writing style.☆10Updated 11 months ago
- A lightweight end-of-utterance detection model fine-tuned on SmolLM2-135M, optimized for Raspberry Pi and low-power devices.☆21Updated 2 months ago