riteshhere / Speaker_diarizationLinks
Speech Diarization for scrum automation
☆111Updated 2 years ago
Alternatives and similar repositories for Speaker_diarization
Users that are interested in Speaker_diarization are comparing it to the libraries listed below
Sorting:
- Open source inference code for Rev's model☆422Updated 4 months ago
- Have a natural voice conversation with an LLM☆253Updated 8 months ago
- A lightweight end-to-end text-to-speech model☆117Updated 6 months ago
- Live-Transcription (STT) with Whisper PoC☆190Updated last year
- A toolkit for speaker diarization.☆261Updated last week
- ☆166Updated 8 months ago
- FastAPI service on top of WhisperX☆121Updated last week
- ☆328Updated 5 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆121Updated last year
- Nendo is an open source platform for AI-driven audio management, intelligence, and generation.☆126Updated last year
- ☆33Updated last year
- ☆175Updated last year
- 🎧 Pod-Helper: Real-time audio transcription and repair on consumer hardware☆77Updated last year
- We Speech Transcript based on LLM, in 300 lines of code.☆176Updated 2 months ago
- Voice Transformation for Videos. 🎤👄🎬☆242Updated 2 months ago
- Examples for Cerebrium Serverless GPUs☆509Updated last week
- OpenAI API and Whisper based Video Translation☆74Updated 8 months ago
- An JS web client for connecting to Pipecat bots with voice and vision☆45Updated 8 months ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆64Updated last year
- RealSI: Open Benchmark for Simultaneous Interpretation in Real-world Scenarios☆66Updated last month
- 用文本编辑器剪视频☆37Updated 2 years ago
- Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.☆125Updated 10 months ago
- Videos Transcription and Translation with Faster Whisper and ChatGPT☆242Updated last year
- ☆170Updated last year
- Tell a story and get a live feed of images.☆138Updated last year
- An API to transcribe audio with OpenAI's Whisper Large v3!☆296Updated 9 months ago
- A real-time Agent framework for audio and video.☆149Updated 2 months ago
- ☆338Updated last year
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆93Updated last year
- g1: Using GPT-4o to create o1-like reasoning chains☆20Updated 11 months ago