Speaker Diarization is the first step in many early audio processing and aims to solve the problem ”who spoke when”. It therefore relies on efficient use of temporal information from extracted audio features.
☆12Dec 7, 2018Updated 7 years ago
Alternatives and similar repositories for Speaker-Change-Detection
Users that are interested in Speaker-Change-Detection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- List of papers about TTS / Список статей о TTS☆10Dec 16, 2017Updated 8 years ago
- Based on https://github.com/fatchord/WaveRNN☆24May 3, 2020Updated 6 years ago
- WaveNet implementation using tf.estimator☆21Jul 6, 2023Updated 2 years ago
- ☆24Oct 9, 2018Updated 7 years ago
- Asterisk PBX voicebot☆10May 9, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A Text2Speech Engine built in Pytorch.☆12Dec 9, 2018Updated 7 years ago
- Speaker Diarization using GRU in PyTorch☆11Aug 29, 2020Updated 5 years ago
- This script is an automated survey bot that conducts political discussions over phone calls. It uses Flask, Twilio's Voice API, OpenAI's …☆12Sep 21, 2023Updated 2 years ago
- GlottDNN vocoder and tools for training DNN excitation models☆33Feb 27, 2021Updated 5 years ago
- style transfer for voice☆10Jul 16, 2018Updated 7 years ago
- End-to-End Probabilistic Inference for Nonstationary Audio Analysis☆12Aug 7, 2019Updated 6 years ago
- This project registers a Python SIP client as an extension in Asterisk/FreePBX and connects calls to OpenAI Voice Agent in real-time usin…☆22Sep 20, 2025Updated 7 months ago
- tts fronted-end☆11Dec 19, 2018Updated 7 years ago
- Machine Learning model to detect answering machines in a voice call☆17Mar 12, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆34Jul 16, 2019Updated 6 years ago
- CLI tool for automating Windows installer creation using Inno Setup.☆16Mar 27, 2026Updated last month
- API to the lacmus project☆11May 17, 2023Updated 2 years ago
- Voice Conversion using Tacotron.☆11Dec 29, 2022Updated 3 years ago
- PyTorch implementation of AVF☆45Sep 2, 2020Updated 5 years ago
- A curated list of internet telephony resources and software☆23Aug 15, 2022Updated 3 years ago
- using world vocoder to extract features and make data for training neural networks☆11Oct 9, 2017Updated 8 years ago
- Prosody-semantics Interface in Seoul Korean☆12Oct 9, 2020Updated 5 years ago
- Tensorflow Implementation of WaveGlow☆37May 4, 2020Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The WhisperX API is a containerized solution for transcribing audio files using the powerful `whisperx` model. This API provides an easy-…☆18Aug 24, 2023Updated 2 years ago
- Speaker diarization via transfer learning☆27Mar 27, 2019Updated 7 years ago
- Wavelet phase harmonic scattering transform☆14Jul 5, 2022Updated 3 years ago
- 2018/2019 TTS framework integrating state of the art open source methods☆48Jul 8, 2019Updated 6 years ago
- This is the repository for the Whisper Python virtual assistant series of videos☆13Apr 7, 2024Updated 2 years ago
- RawNet: Fast End-to-End Neural Vocoder☆43May 29, 2019Updated 6 years ago
- Create speaker voiceprints from a few seconds of audio. And, identify individuals in real-time streaming or recorded conversations.☆15Feb 4, 2019Updated 7 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Oct 4, 2019Updated 6 years ago
- HACK TOOLS☆14Sep 17, 2017Updated 8 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- A Pytorch implementation of "Denoising Auto-encoder with Recurrent Skip Connections and Residual Regression for Music Source Separation"☆13Jul 3, 2019Updated 6 years ago
- A spectrograph display in your terminal☆17Oct 30, 2018Updated 7 years ago
- Mixture of Mixture of Agents: Can a swarm of squabbling specialists outsmart a bigger brain?☆78Apr 14, 2026Updated 3 weeks ago
- A TensorFlow implementation of light convolutional neural network (LCNN)☆12Dec 27, 2018Updated 7 years ago
- A Chainer implementation of ClariNet.☆45Nov 19, 2018Updated 7 years ago
- 自分の声で音声合成☆17Mar 4, 2019Updated 7 years ago
- A simple package to integrate CCAvenue☆10Jan 30, 2026Updated 3 months ago