riteshhere / Speaker_diarization
Speech Diarization for scrum automation
☆103Updated last year
Alternatives and similar repositories for Speaker_diarization:
Users that are interested in Speaker_diarization are comparing it to the libraries listed below
- A lightweight end-to-end text-to-speech model☆112Updated 2 months ago
- Open source inference code for Rev's model☆402Updated last week
- A toolkit for speaker diarization.☆185Updated this week
- We Speech Transcript based on LLM, in 300 lines of code.☆160Updated last week
- Live-Transcription (STT) with Whisper PoC☆181Updated 10 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated 11 months ago
- ☆158Updated 5 months ago
- An JS web client for connecting to Pipecat bots with voice and vision☆44Updated 4 months ago
- 🎧 Pod-Helper: Real-time audio transcription and repair on consumer hardware☆77Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆92Updated last year
- Nendo is an open source platform for AI-driven audio management, intelligence, and generation.☆121Updated last year
- Whisper realtime streaming for long speech-to-text transcription and translation☆114Updated last year
- Have a natural voice conversation with an LLM☆248Updated 4 months ago
- Real time faster whisper gradio☆26Updated 6 months ago
- RealSI: Open Benchmark for Simultaneous Interpretation in Real-world Scenarios☆53Updated 5 months ago
- FastAPI service on top of WhisperX☆92Updated this week
- ☆32Updated last year
- LlamaVoice is a llama-based large voice generation model, providing inference and training ability.☆232Updated 8 months ago
- 用文本编辑器剪视频☆37Updated last year
- TEaR framework for paper "TEaR: Improving LLM-based Machine Translation with Systematic Self-Refinement"☆43Updated 8 months ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆63Updated last year
- ☆174Updated last year
- A real-time AI development framework leveraging WebRTC for audio and video transmission.☆131Updated 2 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆90Updated 7 months ago
- web based editor for subtitles and transcripts☆130Updated 8 months ago
- OpenAI API and Whisper based Video Translation☆73Updated 4 months ago
- ASR (Automatic Speech Recognition) for real-time streamed audio powered by Whisper and tranformers☆30Updated 4 months ago
- Dynamic Voice Actor Assignment and Emotional Narration for Realistic Story Play☆40Updated last month
- ez audio transcription tool with flexible processing and post-processing options☆149Updated last year
- ☆160Updated this week