shvmshukla / Speaker-Change-DetectionView external linksLinks
Speaker Diarization is the first step in many early audio processing and aims to solve the problem ”who spoke when”. It therefore relies on efficient use of temporal information from extracted audio features.
☆12Dec 7, 2018Updated 7 years ago
Alternatives and similar repositories for Speaker-Change-Detection
Users that are interested in Speaker-Change-Detection are comparing it to the libraries listed below
Sorting:
- List of papers about TTS / Список статей о TTS☆10Dec 16, 2017Updated 8 years ago
- WaveNet implementation using tf.estimator☆21Jul 6, 2023Updated 2 years ago
- ☆24Oct 9, 2018Updated 7 years ago
- Based on https://github.com/fatchord/WaveRNN☆24May 3, 2020Updated 5 years ago
- A simple package to integrate CCAvenue☆10Jan 30, 2026Updated 2 weeks ago
- ☆34Jul 16, 2019Updated 6 years ago
- Presentation, Code and Notebooks used in the conference☆11Aug 1, 2023Updated 2 years ago
- PyTorch implementation of AVF☆45Sep 2, 2020Updated 5 years ago
- Tensorflow Implementation of WaveGlow☆37May 4, 2020Updated 5 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Oct 4, 2019Updated 6 years ago
- Archive of my older research papers on optimization☆10Jan 20, 2021Updated 5 years ago
- 2018/2019 TTS framework integrating state of the art open source methods☆48Jul 8, 2019Updated 6 years ago
- API to the lacmus project☆11May 17, 2023Updated 2 years ago
- ☆10Apr 8, 2024Updated last year
- Python bindings for NVIDIA CUDA APIs.☆13Mar 2, 2024Updated last year
- style transfer for voice☆10Jul 16, 2018Updated 7 years ago
- Automatic Speech Recognition using Tensorflow☆46Aug 9, 2017Updated 8 years ago
- End-to-End Probabilistic Inference for Nonstationary Audio Analysis☆12Aug 7, 2019Updated 6 years ago
- Udacity Nanodegree - Data Analyst - Wrangling, Exploring, Analyzing, and Visualizing Data☆10Jul 23, 2017Updated 8 years ago
- Solution for N+1 fish, N+2 fish DrivenData competition (2nd place)☆13Sep 12, 2019Updated 6 years ago
- Implementation for NATv2.☆23Feb 20, 2021Updated 4 years ago
- A Text2Speech Engine built in Pytorch.☆12Dec 9, 2018Updated 7 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- Image and video processing toolbox☆10Jun 12, 2020Updated 5 years ago
- Determines the ethnicity based on your last name☆10Aug 17, 2014Updated 11 years ago
- tts fronted-end☆11Dec 19, 2018Updated 7 years ago
- DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code☆10Mar 8, 2022Updated 3 years ago
- ☆12Jun 5, 2018Updated 7 years ago
- Tensorflow implementation of Nvidia Waveglow☆41Dec 5, 2018Updated 7 years ago
- RawNet: Fast End-to-End Neural Vocoder☆42May 29, 2019Updated 6 years ago
- A Chainer implementation of ClariNet.☆45Nov 19, 2018Updated 7 years ago
- Constrained Permutation Invariant Training, Speech Separation☆52Jan 24, 2021Updated 5 years ago
- ☆11Apr 7, 2019Updated 6 years ago
- Deep Multi-Speech model☆11Jul 25, 2018Updated 7 years ago
- Create speaker voiceprints from a few seconds of audio. And, identify individuals in real-time streaming or recorded conversations.☆14Feb 4, 2019Updated 7 years ago
- MU-GAN: Facial Attribute Editing based on Multi-attention Mechanism☆12Jun 7, 2020Updated 5 years ago
- ☆22Jul 30, 2025Updated 6 months ago
- Wavelet phase harmonic scattering transform☆12Jul 5, 2022Updated 3 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 3 years ago