riteshhere / Speaker_diarization
Speech Diarization for scrum automation
☆101Updated last year
Alternatives and similar repositories for Speaker_diarization:
Users that are interested in Speaker_diarization are comparing it to the libraries listed below
- A lightweight end-to-end text-to-speech model☆99Updated 3 weeks ago
- Open source inference code for Rev's model☆361Updated this week
- A toolkit for speaker diarization.☆164Updated 2 months ago
- We Speech Transcript based on LLM, in 300 lines of code.☆138Updated this week
- ☆151Updated last month
- Have a natural voice conversation with an LLM☆236Updated last month
- Live-Transcription (STT) with Whisper PoC☆165Updated 6 months ago
- TEaR framework for paper "TEaR: Improving LLM-based Machine Translation with Systematic Self-Refinement"☆44Updated 4 months ago
- Real time faster whisper gradio☆26Updated 3 months ago
- g1: Using GPT-4o to create o1-like reasoning chains☆20Updated 4 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆90Updated 8 months ago
- ez audio transcription tool with flexible processing and post-processing options☆140Updated 11 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆110Updated 11 months ago
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆61Updated 7 months ago
- Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.☆78Updated 3 months ago
- ☆32Updated 11 months ago
- ☆172Updated last year
- Cross Platform Open Sourced Chinese NoteBookLM app based on Electron, Use DeepSeek + Reecho.ai☆49Updated 2 months ago
- FastAPI service on top of WhisperX☆63Updated last week
- 用文本编辑器剪视频☆36Updated last year
- self hosted whisper api system based on container☆61Updated 4 months ago
- ⚡️ 80x faster language detection with Fasttext | Split text by language for TTS☆147Updated last week
- Efficient approach to speaker diarization using voice characteristics extraction☆81Updated 8 months ago
- Nendo is an open source platform for AI-driven audio management, intelligence, and generation.☆119Updated 9 months ago
- Voice Transformation for Videos. 🎤👄🎬☆225Updated 3 months ago
- This project provides a RESTful API for converting text to speech using Microsoft's Azure Cognitive Services☆92Updated 7 months ago
- Realtime Video and Audio Streaming with WebRTC and Gradio☆183Updated this week
- RealSI: Open Benchmark for Simultaneous Interpretation in Real-world Scenarios☆49Updated last month
- Instant voice cloning by MyShell.☆85Updated 6 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆78Updated 3 months ago