Efficient approach to speaker diarization using voice characteristics extraction
☆106Jun 17, 2025Updated 10 months ago
Alternatives and similar repositories for WhoSpeaks
Users that are interested in WhoSpeaks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Transcribe desktop audio/computer audio in real-time and locally (Streaming ASR), using TorchAudio and Emformer-RNNT model for inference,…☆14May 7, 2024Updated last year
- Tr-VAD: An Efficient Transformer based Voice Activity Detection Model☆17Aug 1, 2024Updated last year
- Simulates talk with an AI that can express emotions☆84Apr 4, 2026Updated 3 weeks ago
- A python package to build AI-powered real-time audio applications☆1,972Feb 12, 2025Updated last year
- Speaker Diarization with Transformers☆70Jun 8, 2025Updated 10 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Automatically setup the AISHELL-4 and MSDWild dataset for usage with pyannote-database (and pyannote-audio)☆15Oct 22, 2025Updated 6 months ago
- Roomey is a multi-purpose Voice Agent designed to run your personal and business life.☆63Jun 15, 2025Updated 10 months ago
- Low latency ai companion voice talk in 60 lines of code using faster_whisper and elevenlabs input streaming☆317Jun 17, 2025Updated 10 months ago
- THIS IS EXPERIMENTAL VERSION Fully local program to make your own AI waifu! Vtuber model, voice, ect. Emphasis on personal use and compa…☆19Jul 9, 2025Updated 9 months ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆77Mar 31, 2026Updated 3 weeks ago
- C++ version of pyannote audio overlapped speech detection pipeline☆13Feb 14, 2024Updated 2 years ago
- Converts text to speech in realtime☆3,879Apr 9, 2026Updated 2 weeks ago
- Simple PyTorch Denoisers for Waveform Audio☆41Apr 4, 2026Updated 3 weeks ago
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote☆234Feb 19, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Blind Source Separation and Dereverberation☆20Mar 26, 2021Updated 5 years ago
- auto fine tune of models with synthetic data☆78Feb 14, 2024Updated 2 years ago
- Knowledge Graph constructed from Wikipedia☆18Dec 18, 2022Updated 3 years ago
- Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization (ACM MM 2024)☆22Jul 25, 2024Updated last year
- Exploring Binary Classification Loss for Speaker Verification☆18Jul 18, 2023Updated 2 years ago
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Dec 23, 2024Updated last year
- A python package for deep multilingual punctuation prediction.☆162Aug 21, 2024Updated last year
- Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation…☆76Jul 29, 2024Updated last year
- Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper☆5,498Feb 23, 2026Updated 2 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with C…☆721Jun 17, 2025Updated 10 months ago
- 🎹 pyannote + 🗒 notebook = pyannotebook☆26Jun 12, 2023Updated 2 years ago
- Identity verification from speech☆19Jul 19, 2022Updated 3 years ago
- [Colab Demo Code] OneFormer: One Transformer to Rule Universal Image Segmentation.☆14May 24, 2023Updated 2 years ago
- A toolkit for speaker diarization.☆447Apr 9, 2026Updated 2 weeks ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆22Feb 14, 2024Updated 2 years ago
- Minimal VS Code extension for PI Coding Agent.☆74Updated this week
- A macOS command line wrapper around the Apple Vision framework☆33Nov 23, 2025Updated 5 months ago
- Whitepapers and document repository for makepad☆13May 6, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Rust crates to read and use Octopus MDict Dictionary.☆11Aug 27, 2020Updated 5 years ago
- Custom ComfyUI node that combines VSR + VFI and allows streaming processing for arbitrary video length.☆61Mar 28, 2026Updated last month
- replace any object you want on the image with whatever you want☆14Feb 6, 2024Updated 2 years ago
- A scalable solution that simplifies the integration of ComfyUI for developers☆11Jul 15, 2024Updated last year
- ☆13May 23, 2024Updated last year
- An unofficial implementation of the Personal VAD speaker-conditioned voice activity detection method. Bachelor's thesis project.☆83Sep 22, 2022Updated 3 years ago
- A Yiddish orthographic normalizer: Standard Yiddish goes in, Hasidic Yiddish comes out☆16Jun 26, 2024Updated last year