riteshhere / Speaker_diarizationLinks
Speech Diarization for scrum automation
☆105Updated last year
Alternatives and similar repositories for Speaker_diarization
Users that are interested in Speaker_diarization are comparing it to the libraries listed below
Sorting:
- A toolkit for speaker diarization.☆195Updated 2 weeks ago
- A lightweight end-to-end text-to-speech model☆115Updated 3 months ago
- ☆160Updated 6 months ago
- We Speech Transcript based on LLM, in 300 lines of code.☆162Updated last month
- Open source inference code for Rev's model☆404Updated last month
- ☆32Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated last year
- 🎧 Pod-Helper: Real-time audio transcription and repair on consumer hardware☆76Updated last year
- Whisper realtime streaming for long speech-to-text transcription and translation☆116Updated last year
- Nendo is an open source platform for AI-driven audio management, intelligence, and generation.☆121Updated last year
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆93Updated 8 months ago
- Voice Transformation for Videos. 🎤👄🎬☆236Updated 7 months ago
- Dynamic Voice Actor Assignment and Emotional Narration for Realistic Story Play☆40Updated 2 months ago
- A real-time AI development framework leveraging WebRTC for audio and video transmission.☆131Updated 3 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆94Updated last year
- 用文本编辑器剪视频☆37Updated 2 years ago
- Real time faster whisper gradio☆26Updated 7 months ago
- Have a natural voice conversation with an LLM☆250Updated 5 months ago
- Live-Transcription (STT) with Whisper PoC☆183Updated 11 months ago
- RealSI: Open Benchmark for Simultaneous Interpretation in Real-world Scenarios☆54Updated 6 months ago
- ☆173Updated last year
- A Low-Latency, Lightweight and High-Performance Streaming VAD☆411Updated this week
- ☆404Updated 2 weeks ago
- TEaR framework for paper "TEaR: Improving LLM-based Machine Translation with Systematic Self-Refinement"☆44Updated 9 months ago
- ☆329Updated 11 months ago
- This project provides a RESTful API for converting text to speech using Microsoft's Azure Cognitive Services☆95Updated last year
- An JS web client for connecting to Pipecat bots with voice and vision☆44Updated 5 months ago
- FastAPI service on top of WhisperX☆101Updated this week
- g1: Using GPT-4o to create o1-like reasoning chains☆20Updated 8 months ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆35Updated 3 months ago