vishalshar/SpeakerDiarization_RNN_CNN_LSTM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/vishalshar/SpeakerDiarization_RNN_CNN_LSTM)

vishalshar / SpeakerDiarization_RNN_CNN_LSTM

Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should state when speaker starts and ends. In this project, we analyze given audio file with 2 channels and 2 speakers (on separate channels).

☆64

Alternatives and similar repositories for SpeakerDiarization_RNN_CNN_LSTM

Users that are interested in SpeakerDiarization_RNN_CNN_LSTM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wq2012 / SpectralCluster
View on GitHub
Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.
☆553Sep 25, 2024Updated last year
taylorlu / Speaker-Diarization
View on GitHub
speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition
☆501Jul 1, 2021Updated 5 years ago
ljuvela / multiscale-GAN
View on GitHub
Code for ICASSP 2019 paper
☆18Oct 29, 2018Updated 7 years ago
i3thuan5 / hts_engine_python
View on GitHub
python wrap for hts engine
☆14Jan 30, 2018Updated 8 years ago
auspicious3000 / WaveNet-Enhancement
View on GitHub
Speech Enhancement using Bayesian WaveNet
☆96Apr 1, 2018Updated 8 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
HaoFengyuan / EEND-IAAE
View on GitHub
The implementation of "End-to-End Neural Speaker Diarization with an Iterative Adaptive Attractor Estimation", which is accepted by Neura…
☆11Aug 27, 2023Updated 2 years ago
valiakon / MultimodalAnalysis_SpeakerDiarization
View on GitHub
The project tries to solve a speaker diarization problem using audio features, face recognition and video feature extraction from face im…
☆16Feb 10, 2019Updated 7 years ago
mahimg / Speaker-recognition
View on GitHub
Segment speech sequences based on speaker transitions, using ML and DSP.
☆17Jul 30, 2018Updated 7 years ago
hyli666 / DNN-SpeechEnhancement
View on GitHub
☆55Jul 21, 2019Updated 7 years ago
idnavid / speech_activity_detection
View on GitHub
Unsupervised speech activity detection system.
☆11Jul 2, 2018Updated 8 years ago
snsun / pit-speech-separation
View on GitHub
☆131Aug 9, 2018Updated 7 years ago
shincling / TDAAv2
View on GitHub
The updated version of TDAA model.
☆14Jul 2, 2020Updated 6 years ago
funcwj / uPIT-for-speech-separation
View on GitHub
Speech separation with utterance-level PIT experiments
☆106Jul 12, 2018Updated 8 years ago
liyongze / lstm_speaker_verification
View on GitHub
☆35Apr 8, 2019Updated 7 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
funcwj / ge2e-speaker-verification
View on GitHub
Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"
☆103Mar 18, 2019Updated 7 years ago
asteroid-team / pytorch-pit
View on GitHub
Permutation invariant training in PyTorch
☆13Oct 2, 2020Updated 5 years ago
aalto-speech / speaker-diarization
View on GitHub
Speaker diarization scripts, based on AaltoASR
☆191Jan 3, 2019Updated 7 years ago
khaotik / DaNet-Tensorflow
View on GitHub
Tensorflow implementation of "Speaker-independent Speech Separation with Deep Attractor Network"
☆90Feb 2, 2021Updated 5 years ago
multimedia-berkeley / deep_hashing_coverSongDetection
View on GitHub
Cover Song Detection System
☆10Mar 29, 2019Updated 7 years ago
andabi / voice-vector
View on GitHub
Deep neural networks for getting text-independent speaker embedding written in TensorFlow
☆310Nov 19, 2018Updated 7 years ago
HaiFengZeng / clari_wavenet_vocoder
View on GitHub
☆56Aug 21, 2018Updated 7 years ago
iiscleap / NeuralPlda
View on GitHub
Implementation of Neural PLDA (NPLDA) model (A discriminative backend for Speaker Verification)
☆99Apr 20, 2020Updated 6 years ago
nttcslab-sp / mamba-diarization
View on GitHub
Official repository for Mamba-based Segmentation Model for Speaker Diarization
☆47May 13, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
hitachi-speech / EEND
View on GitHub
End-to-End Neural Diarization
☆435Aug 30, 2021Updated 4 years ago
foamliu / Speaker-Embeddings
View on GitHub
PyTorch implementation of a self-attentive speaker embedding
☆17Sep 24, 2019Updated 6 years ago
desh2608 / diarizer
View on GitHub
Clustering-based methods for overlapping diarization
☆84Jan 12, 2024Updated 2 years ago
Jamiroquai88 / VBDiarization
View on GitHub
Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data
☆95Jul 6, 2023Updated 3 years ago
tuan3w / cnn_vocoder
View on GitHub
A fast cnn-based vocoder
☆78Jun 11, 2020Updated 6 years ago
WiraDKP / pytorch_gru_speaker_diarization
View on GitHub
Speaker Diarization using GRU in PyTorch
☆11Aug 29, 2020Updated 5 years ago
nttcslab-sp / EEND-vector-clustering
View on GitHub
This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…
☆81Oct 18, 2022Updated 3 years ago
fgnt / pb_chime5
View on GitHub
Speech enhancement system for the CHiME-5 dinner party scenario
☆111Feb 6, 2025Updated last year
qqueing / DeepSpeaker-pytorch
View on GitHub
Speaker embedding(verification and recognition) using Pytorch
☆369Jul 24, 2020Updated 6 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
calclavia / tal-asrd
View on GitHub
Code for the Paper Speech Recognition and Multi-Speaker Diarization of Long Conversations
☆39Jun 12, 2023Updated 3 years ago
mravanelli / pySpeechRev
View on GitHub
This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of a…
☆97May 30, 2020Updated 6 years ago
sjlee7 / speech-dereverberation
View on GitHub
speech-dereverberation-using-GANs
☆13Jan 28, 2019Updated 7 years ago
ifnspaml / Perceptual-Weighting-Filter-Loss
View on GitHub
A perceptual weighting filter loss for DNN training in speech enhancement
☆24Apr 30, 2022Updated 4 years ago
tobiasfshr / gmm-ubm-speaker-identification-verification
View on GitHub
Implementation of a speaker identification and a speaker verification system based on Gaussian Mixture Models (GMM) in combination with a…
☆21Mar 1, 2018Updated 8 years ago
CSLT-THU / IS2019-VAE
View on GitHub
Tensorflow and kaldi implementation of our paper "VAE-based regularization for deep speaker embedding"
☆11Mar 24, 2023Updated 3 years ago
qqueing / SR_with_kaldi
View on GitHub
Speaker embedding(verification and recognition) using Tensorflow with Kaldi
☆41Sep 18, 2017Updated 8 years ago