KoljaB / WhoSpeaksView external linksLinks
Efficient approach to speaker diarization using voice characteristics extraction
☆106Jun 17, 2025Updated 8 months ago
Alternatives and similar repositories for WhoSpeaks
Users that are interested in WhoSpeaks are comparing it to the libraries listed below
Sorting:
- AI at your fingertips: powerful CLI tools for speech, text, and language processing☆22Sep 2, 2024Updated last year
- Command Your World with Voice☆801Jun 17, 2025Updated 8 months ago
- Experimental implementation of regions in WebVTT building on Anne's WebVTT parser.☆14Oct 19, 2014Updated 11 years ago
- Transcribe desktop audio/computer audio in real-time and locally (Streaming ASR), using TorchAudio and Emformer-RNNT model for inference,…☆14May 7, 2024Updated last year
- Open source framework for voice and multimodal conversational AI☆32Jan 13, 2025Updated last year
- A python package to build AI-powered real-time audio applications☆1,931Feb 12, 2025Updated last year
- Simple PyTorch Denoisers for Waveform Audio☆40Dec 23, 2025Updated last month
- [Colab Demo Code] OneFormer: One Transformer to Rule Universal Image Segmentation.☆14May 24, 2023Updated 2 years ago
- replace any object you want on the image with whatever you want☆14Feb 6, 2024Updated 2 years ago
- Roomey is a multi-purpose Voice Agent designed to run your personal and business life.☆60Jun 15, 2025Updated 8 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆70Oct 21, 2025Updated 3 months ago
- Simulates talk with an AI that can express emotions☆83Jun 17, 2025Updated 8 months ago
- Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization (ACM MM 2024)☆22Jul 25, 2024Updated last year
- A powerful Open-Source tool for transcribe and understand speech.☆23Updated this week
- Python Audio Separator in Real Time using MDX-NET model☆24Jul 30, 2023Updated 2 years ago
- A highly-customizable OpenAI gym environment to train & evaluate RL agents trading stocks and crypto.☆21Jun 6, 2023Updated 2 years ago
- An End-to-End Pipeline for Enhanced French Text-to-Speech with SSML Prosody Control☆30Jan 13, 2026Updated last month
- A python package for deep multilingual punctuation prediction.☆156Aug 21, 2024Updated last year
- Official Deepgram resources for deploying Deepgram services in a self-hosted environment☆33Updated this week
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆74Jul 13, 2025Updated 7 months ago
- ☆27Mar 27, 2024Updated last year
- ☆29Jul 4, 2025Updated 7 months ago
- Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation…☆76Jul 29, 2024Updated last year
- Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper☆5,355Nov 26, 2025Updated 2 months ago
- ☆31Feb 3, 2026Updated 2 weeks ago
- ☆38Apr 3, 2025Updated 10 months ago
- Official Repository For VoxBlink2☆85Aug 13, 2024Updated last year
- Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with C…☆714Jun 17, 2025Updated 8 months ago
- auto fine tune of models with synthetic data☆78Feb 14, 2024Updated 2 years ago
- Transcription and annotation interface for recorded audio or video files☆51Feb 10, 2026Updated last week
- Transcription and diarization (speaker identification)☆34May 31, 2023Updated 2 years ago
- ☆16Jan 13, 2022Updated 4 years ago
- Voice Transformation for Videos. 🎤👄🎬☆259Jun 17, 2025Updated 8 months ago
- Image classification for Recyclables☆10Sep 14, 2020Updated 5 years ago
- ☆11Apr 1, 2025Updated 10 months ago
- Continual Resilient (CoRe) Optimizer for PyTorch☆11Jun 10, 2024Updated last year
- LD-Explorer is the missing tool for exploring, federating and querying linked data resources directly from the browser☆19Feb 9, 2026Updated last week
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote☆233Feb 19, 2025Updated 11 months ago
- List of repositories relevant to VITS.☆36Feb 26, 2023Updated 2 years ago