riteshhere / Speaker_diarizationLinks
Speech Diarization for scrum automation
☆111Updated 2 years ago
Alternatives and similar repositories for Speaker_diarization
Users that are interested in Speaker_diarization are comparing it to the libraries listed below
Sorting:
- Open source inference code for Rev's model☆433Updated 7 months ago
- A lightweight end-to-end text-to-speech model☆123Updated 9 months ago
- ☆166Updated last year
- A toolkit for speaker diarization.☆330Updated 2 weeks ago
- Have a natural voice conversation with an LLM☆258Updated last month
- Live-Transcription (STT) with Whisper PoC☆200Updated last year
- We Speech Transcript based on LLM, in 300 lines of code.☆181Updated 5 months ago
- ☆336Updated 8 months ago
- ☆175Updated 2 years ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆121Updated last year
- RealSI: Open Benchmark for Simultaneous Interpretation in Real-world Scenarios☆74Updated 5 months ago
- 🎧 Pod-Helper: Real-time audio transcription and repair on consumer hardware☆77Updated last year
- ☆33Updated last year
- OpenAI API and Whisper based Video Translation☆74Updated 11 months ago
- GPT-4o-level, real-time spoken dialogue system.☆361Updated 10 months ago
- ☆354Updated last year
- Voice Transformation for Videos. 🎤👄🎬☆256Updated 5 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆98Updated last year
- 用文本编辑器剪视频☆37Updated 2 years ago
- openai realtime webrtc python client☆46Updated 11 months ago
- An JS web client for connecting to Pipecat bots with voice and vision☆44Updated 11 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆106Updated last month
- ASR (Automatic Speech Recognition) for real-time streamed audio powered by Whisper and tranformers☆35Updated 11 months ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆67Updated last year
- ☆467Updated 6 months ago
- A simple Google Colab notebook which can translate an original video into multiple languages along with lip sync.☆252Updated 8 months ago
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆43Updated 2 years ago
- ☆21Updated last year
- Realtime Audio SDK for the Web — audio capture, echo cancellation (AEC), voice activity detection (VAD), and real-time encoding (Opus/PCM…☆118Updated this week
- A real-time Agent framework for audio and video.☆166Updated this week