Speaker Diarization is the first step in many early audio processing and aims to solve the problem ”who spoke when”. It therefore relies on efficient use of temporal information from extracted audio features.
☆12Dec 7, 2018Updated 7 years ago
Alternatives and similar repositories for Speaker-Change-Detection
Users that are interested in Speaker-Change-Detection are comparing it to the libraries listed below
Sorting:
- List of papers about TTS / Список статей о TTS☆10Dec 16, 2017Updated 8 years ago
- WaveNet implementation using tf.estimator☆21Jul 6, 2023Updated 2 years ago
- ☆24Oct 9, 2018Updated 7 years ago
- Based on https://github.com/fatchord/WaveRNN☆24May 3, 2020Updated 5 years ago
- Example Python and R code for Cloudera Machine Learning (CML) training☆14Dec 1, 2020Updated 5 years ago
- A simple package to integrate CCAvenue☆10Jan 30, 2026Updated last month
- GlottDNN vocoder and tools for training DNN excitation models☆32Feb 27, 2021Updated 5 years ago
- ☆34Jul 16, 2019Updated 6 years ago
- PyTorch implementation of AVF☆45Sep 2, 2020Updated 5 years ago
- Presentation, Code and Notebooks used in the conference☆11Aug 1, 2023Updated 2 years ago
- Tensorflow Implementation of WaveGlow☆37May 4, 2020Updated 5 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Oct 4, 2019Updated 6 years ago
- This project registers a Python SIP client as an extension in Asterisk/FreePBX and connects calls to OpenAI Voice Agent in real-time usin…☆23Sep 20, 2025Updated 5 months ago
- Archive of my older research papers on optimization☆10Jan 20, 2021Updated 5 years ago
- 2018/2019 TTS framework integrating state of the art open source methods☆48Jul 8, 2019Updated 6 years ago
- Image and video processing toolbox☆10Jun 12, 2020Updated 5 years ago
- Implementation for NATv2.☆23Feb 20, 2021Updated 5 years ago
- Udacity Nanodegree - Data Analyst - Wrangling, Exploring, Analyzing, and Visualizing Data☆10Jul 23, 2017Updated 8 years ago
- ☆12Jun 5, 2018Updated 7 years ago
- Python bindings for NVIDIA CUDA APIs.☆13Mar 2, 2024Updated 2 years ago
- Automatic Speech Recognition using Tensorflow☆46Aug 9, 2017Updated 8 years ago
- A Text2Speech Engine built in Pytorch.☆12Dec 9, 2018Updated 7 years ago
- Asterisk PBX voicebot☆10May 9, 2023Updated 2 years ago
- This script is an automated survey bot that conducts political discussions over phone calls. It uses Flask, Twilio's Voice API, OpenAI's …☆12Sep 21, 2023Updated 2 years ago
- API to the lacmus project☆11May 17, 2023Updated 2 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- tts fronted-end☆11Dec 19, 2018Updated 7 years ago
- style transfer for voice☆10Jul 16, 2018Updated 7 years ago
- ☆10Apr 8, 2024Updated last year
- Determines the ethnicity based on your last name☆10Aug 17, 2014Updated 11 years ago
- End-to-End Probabilistic Inference for Nonstationary Audio Analysis☆12Aug 7, 2019Updated 6 years ago
- Predictive Guardians: An AI-driven crime prevention solution utilizing advanced analytics, machine learning, and optimization. Uncover cr…☆12Mar 8, 2025Updated last year
- DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code☆10Mar 8, 2022Updated 4 years ago
- Solution for N+1 fish, N+2 fish DrivenData competition (2nd place)☆13Sep 12, 2019Updated 6 years ago
- Data and source for Azure Computer Vision classify birds with Python SDK☆11Jan 20, 2021Updated 5 years ago
- RawNet: Fast End-to-End Neural Vocoder☆42May 29, 2019Updated 6 years ago
- Tensorflow implementation of Nvidia Waveglow☆41Dec 5, 2018Updated 7 years ago
- A Chainer implementation of ClariNet.☆45Nov 19, 2018Updated 7 years ago
- Constrained Permutation Invariant Training, Speech Separation☆52Jan 24, 2021Updated 5 years ago