Efficient approach to speaker diarization using voice characteristics extraction
☆107Jun 26, 2026Updated this week
Alternatives and similar repositories for WhoSpeaks
Users that are interested in WhoSpeaks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Command Your World with Voice☆810Jun 17, 2025Updated last year
- Tr-VAD: An Efficient Transformer based Voice Activity Detection Model☆18Aug 1, 2024Updated last year
- Transcribe desktop audio/computer audio in real-time and locally (Streaming ASR), using TorchAudio and Emformer-RNNT model for inference,…☆14May 7, 2024Updated 2 years ago
- Simulates talk with an AI that can express emotions☆88Apr 4, 2026Updated 2 months ago
- Experimental implementation of regions in WebVTT building on Anne's WebVTT parser.☆14Oct 19, 2014Updated 11 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆71Apr 22, 2026Updated 2 months ago
- A python package to build AI-powered real-time audio applications☆1,991Jun 19, 2026Updated last week
- Speaker Diarization with Transformers☆70Jun 8, 2025Updated last year
- Automatically setup the AISHELL-4 and MSDWild dataset for usage with pyannote-database (and pyannote-audio)☆15Oct 22, 2025Updated 8 months ago
- Roomey is a multi-purpose Voice Agent designed to run your personal and business life.☆66Jun 15, 2025Updated last year
- Python Audio Separator in Real Time using MDX-NET model☆25Jul 30, 2023Updated 2 years ago
- Low latency ai companion voice talk in 60 lines of code using faster_whisper and elevenlabs input streaming☆320Jun 17, 2025Updated last year
- PAFTS : Library That Preprocessing Audio For TTS.☆27Nov 15, 2024Updated last year
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆81Updated this week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- C++ version of pyannote audio overlapped speech detection pipeline☆13Feb 14, 2024Updated 2 years ago
- Converts text to speech in realtime☆3,971May 31, 2026Updated 3 weeks ago
- Simple PyTorch Denoisers for Waveform Audio☆41Apr 4, 2026Updated 2 months ago
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote☆238Jun 11, 2026Updated 2 weeks ago
- An application-layer router for Skupper networks☆20May 28, 2026Updated last month
- auto fine tune of models with synthetic data☆78Feb 14, 2024Updated 2 years ago
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆19Updated this week
- Knowledge Graph constructed from Wikipedia☆19Dec 18, 2022Updated 3 years ago
- Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization (ACM MM 2024)☆22Jul 25, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆88Jul 31, 2025Updated 10 months ago
- A python package for deep multilingual punctuation prediction.☆165Aug 21, 2024Updated last year
- Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation…☆78Jul 29, 2024Updated last year
- Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with C…☆723Jun 17, 2025Updated last year
- Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper☆5,578Feb 23, 2026Updated 4 months ago
- 🎹 pyannote + 🗒 notebook = pyannotebook☆27Jun 12, 2023Updated 3 years ago
- Identity verification from speech☆19Jul 19, 2022Updated 3 years ago
- Python Wrapper around Ollama API Endpoints☆12Jan 26, 2024Updated 2 years ago
- Svelte app to generate audiobooks using XTTS☆12Feb 13, 2024Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [Colab Demo Code] OneFormer: One Transformer to Rule Universal Image Segmentation.☆14May 24, 2023Updated 3 years ago
- A toolkit for speaker diarization.☆485May 29, 2026Updated last month
- FastAPI WebSocket server for the OpenVoice text-to-speech model.☆12Jun 6, 2024Updated 2 years ago
- https://avocado-captioner.github.io/☆37Oct 16, 2025Updated 8 months ago
- Whitepapers and document repository for makepad☆13May 6, 2022Updated 4 years ago
- replace any object you want on the image with whatever you want☆14Feb 6, 2024Updated 2 years ago
- A simple Python tool to measure the performance of ONNX models.☆27Sep 15, 2024Updated last year