riteshhere / Speaker_diarizationLinks
Speech Diarization for scrum automation
☆111Updated 2 years ago
Alternatives and similar repositories for Speaker_diarization
Users that are interested in Speaker_diarization are comparing it to the libraries listed below
Sorting:
- Open source inference code for Rev's model☆433Updated 6 months ago
- A lightweight end-to-end text-to-speech model☆123Updated 8 months ago
- ☆166Updated 11 months ago
- Have a natural voice conversation with an LLM☆259Updated last month
- A toolkit for speaker diarization.☆319Updated last month
- Live-Transcription (STT) with Whisper PoC☆200Updated last year
- We Speech Transcript based on LLM, in 300 lines of code.☆177Updated 4 months ago
- ☆174Updated last year
- 🎧 Pod-Helper: Real-time audio transcription and repair on consumer hardware☆76Updated last year
- Voice Transformation for Videos. 🎤👄🎬☆247Updated 4 months ago
- RealSI: Open Benchmark for Simultaneous Interpretation in Real-world Scenarios☆73Updated 4 months ago
- 用文本编辑器剪视频☆37Updated 2 years ago
- Nendo is an open source platform for AI-driven audio management, intelligence, and generation.☆128Updated last year
- ☆466Updated 5 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆121Updated last year
- ☆350Updated last year
- Tell a story and get a live feed of images.☆137Updated last year
- ☆33Updated last year
- openai realtime webrtc python client☆46Updated 10 months ago
- Examples for Cerebrium Serverless GPUs☆512Updated 2 weeks ago
- OpenAI API and Whisper based Video Translation☆74Updated 11 months ago
- ⚡️ 80x faster Fasttext language detection out of the box | Split text by language☆254Updated last month
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆106Updated last month
- An API to transcribe audio with OpenAI's Whisper Large v3!☆310Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆104Updated 4 months ago
- Videos Transcription and Translation with Faster Whisper and ChatGPT☆240Updated last year
- ASR + diarization model server with speculative decoding☆63Updated last year
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆160Updated last week
- ☆170Updated last year
- Realtime Audio SDK for the Web — audio capture, echo cancellation (AEC), voice activity detection (VAD), and real-time encoding (Opus/PCM…☆116Updated last month