riteshhere / Speaker_diarizationLinks
Speech Diarization for scrum automation
☆111Updated 2 years ago
Alternatives and similar repositories for Speaker_diarization
Users that are interested in Speaker_diarization are comparing it to the libraries listed below
Sorting:
- Open source inference code for Rev's model☆435Updated 9 months ago
- A lightweight end-to-end text-to-speech model☆126Updated 11 months ago
- ☆167Updated last year
- Have a natural voice conversation with an LLM☆262Updated 3 weeks ago
- Live-Transcription (STT) with Whisper PoC☆202Updated last year
- A toolkit for speaker diarization.☆392Updated this week
- We Speech Transcript based on LLM, in 300 lines of code.☆183Updated 7 months ago
- ☆175Updated 2 years ago
- Voice Transformation for Videos. 🎤👄🎬☆258Updated 7 months ago
- ☆341Updated 10 months ago
- ☆34Updated 2 years ago
- OpenAI API and Whisper based Video Translation☆74Updated last year
- RealSI: Open Benchmark for Simultaneous Interpretation in Real-world Scenarios☆79Updated 7 months ago
- 🎧 Pod-Helper: Real-time audio transcription and repair on consumer hardware☆77Updated last year
- openai realtime webrtc python client☆47Updated last year
- 用文本编辑器剪视频☆37Updated 2 years ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆121Updated 2 years ago
- Real time faster whisper gradio☆25Updated 5 months ago
- TEaR framework for paper "TEaR: Improving LLM-based Machine Translation with Systematic Self-Refinement"☆50Updated last year
- ASR (Automatic Speech Recognition) for real-time streamed audio powered by Whisper and tranformers☆35Updated last year
- ☆360Updated last year
- ASR + diarization model server with speculative decoding☆64Updated last year
- A real-time Agent framework for audio and video.☆169Updated last month
- This repository provides a Docker image for CosyVoice☆27Updated last year
- Videos Transcription and Translation with Faster Whisper and ChatGPT☆242Updated last year
- Conversational Retrieval Evaluation Dataset☆101Updated 5 months ago
- Realtime Audio SDK for the Web — audio capture, echo cancellation (AEC), voice activity detection (VAD), and real-time encoding (Opus/PCM…☆123Updated 2 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆109Updated 4 months ago
- Nendo is an open source platform for AI-driven audio management, intelligence, and generation.☆129Updated last year
- Effortlessly record, transcribe, and summarize meetings with this user-friendly desktop utility powered by OpenAI's Whisper and GPT-3.5-t…☆187Updated 2 years ago