Efficient approach to speaker diarization using voice characteristics extraction
☆107Jun 17, 2025Updated 11 months ago
Alternatives and similar repositories for WhoSpeaks
Users that are interested in WhoSpeaks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Command Your World with Voice☆810Jun 17, 2025Updated 11 months ago
- Transcribe desktop audio/computer audio in real-time and locally (Streaming ASR), using TorchAudio and Emformer-RNNT model for inference,…☆14May 7, 2024Updated 2 years ago
- Tr-VAD: An Efficient Transformer based Voice Activity Detection Model☆18Aug 1, 2024Updated last year
- Simulates talk with an AI that can express emotions☆87Apr 4, 2026Updated 2 months ago
- A python package to build AI-powered real-time audio applications☆1,982Feb 12, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Speaker Diarization with Transformers☆70Jun 8, 2025Updated last year
- Automatically setup the AISHELL-4 and MSDWild dataset for usage with pyannote-database (and pyannote-audio)☆15Oct 22, 2025Updated 7 months ago
- Roomey is a multi-purpose Voice Agent designed to run your personal and business life.☆66Jun 15, 2025Updated 11 months ago
- Low latency ai companion voice talk in 60 lines of code using faster_whisper and elevenlabs input streaming☆319Jun 17, 2025Updated 11 months ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆79Mar 31, 2026Updated 2 months ago
- C++ version of pyannote audio overlapped speech detection pipeline☆13Feb 14, 2024Updated 2 years ago
- Converts text to speech in realtime☆3,943May 31, 2026Updated last week
- Simple PyTorch Denoisers for Waveform Audio☆41Apr 4, 2026Updated 2 months ago
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote☆236Feb 19, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Blind Source Separation and Dereverberation☆21Mar 26, 2021Updated 5 years ago
- An application-layer router for Skupper networks☆20May 28, 2026Updated last week
- auto fine tune of models with synthetic data☆78Feb 14, 2024Updated 2 years ago
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆19Updated this week
- Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization (ACM MM 2024)☆22Jul 25, 2024Updated last year
- Exploring Binary Classification Loss for Speaker Verification☆18Jul 18, 2023Updated 2 years ago
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Dec 23, 2024Updated last year
- Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation…☆78Jul 29, 2024Updated last year
- Multi Browser Kango Extension for BGPView - A DNS and BGP network visualizer☆10May 16, 2017Updated 9 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper☆5,549Feb 23, 2026Updated 3 months ago
- 🎹 pyannote + 🗒 notebook = pyannotebook☆26Jun 12, 2023Updated 2 years ago
- Identity verification from speech☆19Jul 19, 2022Updated 3 years ago
- Svelte app to generate audiobooks using XTTS☆12Feb 13, 2024Updated 2 years ago
- A toolkit for speaker diarization.☆473May 29, 2026Updated last week
- FastAPI WebSocket server for the OpenVoice text-to-speech model.☆12Jun 6, 2024Updated 2 years ago
- Custom ComfyUI node that combines VSR + VFI and allows streaming processing for arbitrary video length.☆66Mar 28, 2026Updated 2 months ago
- audio, NLP, ML with huggingface, nvidia/nemo, speechbrain☆11Sep 4, 2023Updated 2 years ago
- A simple Python tool to measure the performance of ONNX models.☆27Sep 15, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- A highly-customizable OpenAI gym environment to train & evaluate RL agents trading stocks and crypto.☆21Jun 6, 2023Updated 3 years ago
- Personal assistant, project and schedule manager, coach, motivator, angry girlfriend and salvation - character AI waifu llm based on olla…☆12Jan 26, 2026Updated 4 months ago
- An unofficial implementation of the Personal VAD speaker-conditioned voice activity detection method. Bachelor's thesis project.☆86Sep 22, 2022Updated 3 years ago
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆219Oct 30, 2024Updated last year
- ☆13May 23, 2024Updated 2 years ago
- Some comprehensive papers about speaker diarization☆359Mar 24, 2026Updated 2 months ago