riteshhere / Speaker_diarizationLinks
Speech Diarization for scrum automation
☆108Updated last year
Alternatives and similar repositories for Speaker_diarization
Users that are interested in Speaker_diarization are comparing it to the libraries listed below
Sorting:
- Open source inference code for Rev's model☆411Updated 2 months ago
- A toolkit for speaker diarization.☆224Updated 2 weeks ago
- A lightweight end-to-end text-to-speech model☆115Updated 4 months ago
- Have a natural voice conversation with an LLM☆250Updated 7 months ago
- ☆165Updated 7 months ago
- We Speech Transcript based on LLM, in 300 lines of code.☆170Updated 3 weeks ago
- RealSI: Open Benchmark for Simultaneous Interpretation in Real-world Scenarios☆56Updated last week
- ☆326Updated 3 months ago
- OpenAI API and Whisper based Video Translation☆73Updated 7 months ago
- Nendo is an open source platform for AI-driven audio management, intelligence, and generation.☆125Updated last year
- Live-Transcription (STT) with Whisper PoC☆188Updated last year
- ☆32Updated last year
- 🎧 Pod-Helper: Real-time audio transcription and repair on consumer hardware☆77Updated last year
- Voice Transformation for Videos. 🎤👄🎬☆240Updated 3 weeks ago
- A real-time Agent framework for audio and video.☆137Updated 3 weeks ago
- ☆175Updated last year
- ☆439Updated last month
- This project provides a RESTful API for converting text to speech using Microsoft's Azure Cognitive Services☆96Updated last year
- ☆171Updated 10 months ago
- Videos Transcription and Translation with Faster Whisper and ChatGPT☆242Updated last year
- A simple Google Colab notebook which can translate an original video into multiple languages along with lip sync.☆249Updated 4 months ago
- Tell a story and get a live feed of images.☆137Updated last year
- ☆336Updated last year
- Real time faster whisper gradio☆26Updated 9 months ago
- TEaR framework for paper "TEaR: Improving LLM-based Machine Translation with Systematic Self-Refinement"☆44Updated 10 months ago
- Instant voice cloning by MyShell.☆89Updated 11 months ago
- Dynamic Voice Actor Assignment and Emotional Narration for Realistic Story Play☆40Updated 3 months ago
- Faster Whisper transcription with CTranslate2☆85Updated last year
- openai realtime webrtc python client☆42Updated 6 months ago
- GPT-4o-level, real-time spoken dialogue system.☆343Updated 5 months ago