riteshhere / Speaker_diarizationLinks
Speech Diarization for scrum automation
☆111Updated 2 years ago
Alternatives and similar repositories for Speaker_diarization
Users that are interested in Speaker_diarization are comparing it to the libraries listed below
Sorting:
- A lightweight end-to-end text-to-speech model☆120Updated 7 months ago
- Open source inference code for Rev's model☆429Updated 5 months ago
- ☆166Updated 10 months ago
- Have a natural voice conversation with an LLM☆255Updated this week
- A toolkit for speaker diarization.☆303Updated last month
- We Speech Transcript based on LLM, in 300 lines of code.☆176Updated 3 months ago
- Live-Transcription (STT) with Whisper PoC☆196Updated last year
- RealSI: Open Benchmark for Simultaneous Interpretation in Real-world Scenarios☆67Updated 3 months ago
- OpenAI API and Whisper based Video Translation☆74Updated 9 months ago
- ☆175Updated last year
- 🎧 Pod-Helper: Real-time audio transcription and repair on consumer hardware☆77Updated last year
- ☆33Updated last year
- A real-time Agent framework for audio and video.☆150Updated 3 months ago
- g1: Using GPT-4o to create o1-like reasoning chains☆20Updated last year
- Dynamic Voice Actor Assignment and Emotional Narration for Realistic Story Play☆40Updated 6 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆121Updated last year
- Nendo is an open source platform for AI-driven audio management, intelligence, and generation.☆128Updated last year
- openai realtime webrtc python client☆45Updated 9 months ago
- Voice Transformation for Videos. 🎤👄🎬☆243Updated 3 months ago
- Real time faster whisper gradio☆26Updated last month
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆101Updated last year
- GPT-4o-level, real-time spoken dialogue system.☆356Updated 8 months ago
- ☆170Updated last year
- Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.☆129Updated last month
- ☆344Updated last year
- 用文本编辑器剪视频☆37Updated 2 years ago
- ☆20Updated 10 months ago
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆75Updated last week
- ASR + diarization model server with speculative decoding☆62Updated last year
- This project provides a RESTful API for converting text to speech using Microsoft's Azure Cognitive Services☆96Updated last year