Speaker Diarization is the first step in many early audio processing and aims to solve the problem ”who spoke when”. It therefore relies on efficient use of temporal information from extracted audio features.
☆12Dec 7, 2018Updated 7 years ago
Alternatives and similar repositories for Speaker-Change-Detection
Users that are interested in Speaker-Change-Detection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- List of papers about TTS / Список статей о TTS☆10Dec 16, 2017Updated 8 years ago
- Classifying utterances in Hindi speech in one of the 8 emotional states (anger, fear, disgust, neutral, sad, happy, surprise, sarcastic) …☆11Apr 28, 2016Updated 10 years ago
- Based on https://github.com/fatchord/WaveRNN☆24May 3, 2020Updated 6 years ago
- WaveNet implementation using tf.estimator☆21Jul 6, 2023Updated 2 years ago
- ☆24Oct 9, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A Text2Speech Engine built in Pytorch.☆12Dec 9, 2018Updated 7 years ago
- GlottDNN vocoder and tools for training DNN excitation models☆33Feb 27, 2021Updated 5 years ago
- style transfer for voice☆10Jul 16, 2018Updated 7 years ago
- End-to-End Probabilistic Inference for Nonstationary Audio Analysis☆12Aug 7, 2019Updated 6 years ago
- This project registers a Python SIP client as an extension in Asterisk/FreePBX and connects calls to OpenAI Voice Agent in real-time usin…☆23Sep 20, 2025Updated 8 months ago
- tts fronted-end☆11Dec 19, 2018Updated 7 years ago
- Machine Learning model to detect answering machines in a voice call☆17Mar 12, 2026Updated 2 months ago
- ☆34Jul 16, 2019Updated 6 years ago
- CLI tool for automating Windows installer creation using Inno Setup.☆16Mar 27, 2026Updated 2 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- API to the lacmus project☆11May 17, 2023Updated 3 years ago
- Voice Conversion using Tacotron.☆11Dec 29, 2022Updated 3 years ago
- PyTorch implementation of AVF☆45Sep 2, 2020Updated 5 years ago
- Intrusion. Custom Asterisk dial plan for listen, whisper and barge in calls. For Asterisk FreePBX, Issabel, Asterisk based Elastix call c…☆16Jul 9, 2021Updated 4 years ago
- using world vocoder to extract features and make data for training neural networks☆11Oct 9, 2017Updated 8 years ago
- Predictive Guardians: An AI-driven crime prevention solution utilizing advanced analytics, machine learning, and optimization. Uncover cr…☆13Mar 8, 2025Updated last year
- Prosody-semantics Interface in Seoul Korean☆12Oct 9, 2020Updated 5 years ago
- Call Center Intelligence powered by Azure AI☆16Feb 16, 2022Updated 4 years ago
- Tensorflow Implementation of WaveGlow☆37May 4, 2020Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The WhisperX API is a containerized solution for transcribing audio files using the powerful `whisperx` model. This API provides an easy-…☆18Aug 24, 2023Updated 2 years ago
- Speaker diarization via transfer learning☆27Mar 27, 2019Updated 7 years ago
- Wavelet phase harmonic scattering transform☆14Jul 5, 2022Updated 3 years ago
- 2018/2019 TTS framework integrating state of the art open source methods☆48Jul 8, 2019Updated 6 years ago
- This is the repository for the Whisper Python virtual assistant series of videos☆13Apr 7, 2024Updated 2 years ago
- RawNet: Fast End-to-End Neural Vocoder☆43May 29, 2019Updated 7 years ago
- Create speaker voiceprints from a few seconds of audio. And, identify individuals in real-time streaming or recorded conversations.☆15Feb 4, 2019Updated 7 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Oct 4, 2019Updated 6 years ago
- HACK TOOLS☆14Sep 17, 2017Updated 8 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- This is the guide to show the method to build your own AI-Powered voice agent with LiveKit and Twillio☆27Feb 5, 2025Updated last year
- A Pytorch implementation of "Denoising Auto-encoder with Recurrent Skip Connections and Residual Regression for Music Source Separation"☆13Jul 3, 2019Updated 6 years ago
- A spectrograph display in your terminal☆17Oct 30, 2018Updated 7 years ago
- Keras based Reading Comprehension Models☆19Dec 20, 2018Updated 7 years ago
- A TensorFlow implementation of light convolutional neural network (LCNN)☆12Dec 27, 2018Updated 7 years ago
- A Chainer implementation of ClariNet.☆45Nov 19, 2018Updated 7 years ago
- 自分の声で音声合成☆17Mar 4, 2019Updated 7 years ago