voxos-ai / streaming-whisper-server
A streaming whisper server for on-prem transcription
☆17Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for streaming-whisper-server
- Joint speech-language model - respond directly to audio!☆30Updated 6 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆45Updated 2 weeks ago
- Speaker diarization service☆19Updated this week
- ☆152Updated last year
- Build reliable, secure, and production-ready AI apps easily.☆46Updated 2 weeks ago
- Efficient approach to speaker diarization using voice characteristics extraction☆68Updated 6 months ago
- Build Agentic workflows with function calling☆20Updated this week
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆84Updated 6 months ago
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆23Updated last month
- Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023☆47Updated last year
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆40Updated 3 weeks ago
- ☆37Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆61Updated 2 weeks ago
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆45Updated last year
- ☆23Updated 3 weeks ago
- This package is the Python implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid…☆18Updated last month
- Writing Blog Posts with Generative Feedback Loops!☆43Updated 8 months ago
- ☆31Updated 8 months ago
- [WIP] AI Try-On plugin for Chrome☆25Updated 8 months ago
- Public reports detailing responses to sets of prompts by Large Language Models.☆26Updated last year
- ☆18Updated this week
- Zero-shot Audio Classification using Whisper☆74Updated last year
- Fun project: LLM powered RAG Discord Bot that works seamlessly on CPU☆30Updated last year
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆196Updated 3 weeks ago
- Self-host LLMs with vLLM and BentoML☆74Updated last week
- Cog wrapper for collabora/WhisperSpeech☆25Updated 8 months ago
- A function to do all☆35Updated 7 months ago
- ☆54Updated this week
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…☆26Updated 6 months ago
- Demo FastAPI WebSocket Audio☆35Updated 4 years ago