Speaker Diarization is the first step in many early audio processing and aims to solve the problem ”who spoke when”. It therefore relies on efficient use of temporal information from extracted audio features.
☆12Dec 7, 2018Updated 7 years ago
Alternatives and similar repositories for Speaker-Change-Detection
Users that are interested in Speaker-Change-Detection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- List of papers about TTS / Список статей о TTS☆10Dec 16, 2017Updated 8 years ago
- Classifying utterances in Hindi speech in one of the 8 emotional states (anger, fear, disgust, neutral, sad, happy, surprise, sarcastic) …☆11Apr 28, 2016Updated 10 years ago
- Based on https://github.com/fatchord/WaveRNN☆24May 3, 2020Updated 6 years ago
- WaveNet implementation using tf.estimator☆21Jul 6, 2023Updated 2 years ago
- ☆24Oct 9, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Asterisk PBX voicebot☆10May 9, 2023Updated 3 years ago
- Bank Marketing data classification☆12Oct 2, 2020Updated 5 years ago
- GlottDNN vocoder and tools for training DNN excitation models☆34Feb 27, 2021Updated 5 years ago
- style transfer for voice☆10Jul 16, 2018Updated 7 years ago
- End-to-End Probabilistic Inference for Nonstationary Audio Analysis☆12Aug 7, 2019Updated 6 years ago
- tts fronted-end☆11Dec 19, 2018Updated 7 years ago
- ☆34Jul 16, 2019Updated 6 years ago
- Analytics projects using Big Data eco-systems (Hadoop, Spark, Storm)☆17Dec 27, 2021Updated 4 years ago
- API to the lacmus project☆11May 17, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆16Aug 1, 2018Updated 7 years ago
- Voice Conversion using Tacotron.☆11Dec 29, 2022Updated 3 years ago
- PyTorch implementation of AVF☆45Sep 2, 2020Updated 5 years ago
- A curated list of internet telephony resources and software☆23Aug 15, 2022Updated 3 years ago
- Intrusion. Custom Asterisk dial plan for listen, whisper and barge in calls. For Asterisk FreePBX, Issabel, Asterisk based Elastix call c…☆16Jul 9, 2021Updated 4 years ago
- using world vocoder to extract features and make data for training neural networks☆11Oct 9, 2017Updated 8 years ago
- Predictive Guardians: An AI-driven crime prevention solution utilizing advanced analytics, machine learning, and optimization. Uncover cr…☆13Mar 8, 2025Updated last year
- Prosody-semantics Interface in Seoul Korean☆12Oct 9, 2020Updated 5 years ago
- Call Center Intelligence powered by Azure AI☆16Feb 16, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆16Jun 9, 2023Updated 3 years ago
- Tensorflow Implementation of WaveGlow☆37May 4, 2020Updated 6 years ago
- Speaker diarization via transfer learning☆27Mar 27, 2019Updated 7 years ago
- Wavelet phase harmonic scattering transform☆14Jul 5, 2022Updated 3 years ago
- TTS framework integrating state of the art open source methods (2018/2019)☆48Jun 9, 2026Updated last week
- RawNet: Fast End-to-End Neural Vocoder☆43May 29, 2019Updated 7 years ago
- This is the repository for the Whisper Python virtual assistant series of videos☆13Apr 7, 2024Updated 2 years ago
- Create speaker voiceprints from a few seconds of audio. And, identify individuals in real-time streaming or recorded conversations.☆15Feb 4, 2019Updated 7 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Oct 4, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This is the guide to show the method to build your own AI-Powered voice agent with LiveKit and Twillio☆27Feb 5, 2025Updated last year
- A Pytorch implementation of "Denoising Auto-encoder with Recurrent Skip Connections and Residual Regression for Music Source Separation"☆13Jul 3, 2019Updated 6 years ago
- A spectrograph display in your terminal☆17Oct 30, 2018Updated 7 years ago
- A TensorFlow implementation of light convolutional neural network (LCNN)☆12Dec 27, 2018Updated 7 years ago
- A Chainer implementation of ClariNet.☆45Nov 19, 2018Updated 7 years ago
- 自分の声で音声合成☆17Mar 4, 2019Updated 7 years ago
- Tools to compute and visualize economic models☆23Jan 1, 2020Updated 6 years ago