shashikg/X-Vector-Based-Speaker-Diarization

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/shashikg/X-Vector-Based-Speaker-Diarization)

shashikg / X-Vector-Based-Speaker-Diarization

Course project for EE698R (2020-21 Sem 2). An X-Vector Based Speaker Diarization System with AutoEncoder based clustering method. Also supports spectral and KMeans clustering method.

☆16

Alternatives and similar repositories for X-Vector-Based-Speaker-Diarization

Users that are interested in X-Vector-Based-Speaker-Diarization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cvqluu / nn-similarity-diarization
View on GitHub
Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clust…
☆43Oct 21, 2020Updated 5 years ago
iiscleap / self_supervised_AHC
View on GitHub
Contains code for Deep Self Supervised Heirarchical Clustering for Speaker Diarization
☆17Dec 16, 2021Updated 4 years ago
hqsiswiliam / punctuation-restoration-scl
View on GitHub
Token-Level Supervised Contrastive Learning for Punctuation Restoration
☆29Sep 8, 2021Updated 4 years ago
liaochengcsu / Cascade_Residual_Attention_Enhanced_for_Refinement_Road_Extraction
View on GitHub
The pytorch implementation for the paper of 'Cascaded Residual Attention Enhanced Road Extraction from Remote Sensing Images'
☆14Dec 29, 2021Updated 4 years ago
iwaterxt / voiceprint
View on GitHub
text-independent speaker identification
☆12Apr 9, 2018Updated 8 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
iiscleap / NISP-Dataset
View on GitHub
☆31Aug 9, 2022Updated 3 years ago
Yuanjiayii / VGGT-360
View on GitHub
☆15Jun 24, 2026Updated 3 weeks ago
dtc111111 / Reloc-VGGT
View on GitHub
☆20Dec 25, 2025Updated 6 months ago
tjdevWorks / TEASEL
View on GitHub
☆26May 8, 2022Updated 4 years ago
ymoslem / MT-Tools
View on GitHub
Collection of Common Machine Translation Tools
☆11Jul 26, 2022Updated 3 years ago
Raghvender1205 / AI_From_Scratch
View on GitHub
Into the depths of some concepts of Artificial Intelligence and Machine Learning
☆10Apr 4, 2026Updated 3 months ago
akhilmathurs / libriadapt
View on GitHub
Instructions on downloading and using the LibriAdapt dataset
☆47Aug 13, 2021Updated 4 years ago
Yuan-ManX / audio-ai-agent
View on GitHub
Here we will track the latest Audio AI Agent, including speech, music, sound effects, etc.
☆16Dec 8, 2023Updated 2 years ago
pyannote / pyannote-pipeline
View on GitHub
Tunable pipelines
☆41Sep 9, 2025Updated 10 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
zskuang58 / WTRN-TIP
View on GitHub
☆23Jul 4, 2022Updated 4 years ago
wq2012 / SpectralCluster
View on GitHub
Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.
☆552Sep 25, 2024Updated last year
mehedihasanbijoy / DPCSpell
View on GitHub
[Computer Speech & Language] A transformer-based spelling error correction framework for Bangla and resource scarce Indic languages
☆14Aug 9, 2024Updated last year
quamernasim / Conversational-AI-System-using-Phi-2-PGVector-and-Llama-Index
View on GitHub
Build a Conversational AI System that can answer questions by retrieving the answers from a document.
☆11Feb 23, 2024Updated 2 years ago
shapespark / shapespark-viewer-api
View on GitHub
JavaScript API for interacting with the Shapespark 3D scene.
☆22Sep 14, 2022Updated 3 years ago
jmu201521121021 / RobustVideoMatting
View on GitHub
Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!
☆16Aug 26, 2021Updated 4 years ago
BengaliAI / BADLAD
View on GitHub
BADLAD: Bengali Document Layout Analysis Dataset
☆15May 12, 2024Updated 2 years ago
asiff00 / Training-TTS
View on GitHub
Train and finutune text-to-speech models for Bengali and many other languages!
☆18Apr 2, 2025Updated last year
ssmlkl / MnTTS2
View on GitHub
This is the experimental description of MnTTS2.
☆12Apr 11, 2024Updated 2 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
mayeenulislam / nagri-bangla
View on GitHub
An open-source initiative to transcribe Silôṭi Nagri-Bānglā, and vice-versa. It's still in Alpha mode. See the demo:
☆12Apr 9, 2021Updated 5 years ago
interactive-cookbook / ara
View on GitHub
Corpus and code for Aligned Recipe Actions (ARA) corpus, EMNLP 2021
☆10May 22, 2024Updated 2 years ago
amannm / super-resolution
View on GitHub
client-side deep learning super resolution using TensorFlow.js
☆15Nov 7, 2021Updated 4 years ago
saiful9379 / Bangla_TTS
View on GitHub
Bangla TTS Inference pipeline using Vit TTS
☆13Mar 24, 2024Updated 2 years ago
FlorianKrey / DNC
View on GitHub
Discriminative Neural Clustering for Speaker Diarisation
☆79Apr 8, 2022Updated 4 years ago
SpeechColab / PySpeechColab
View on GitHub
A library of speech gadgets.
☆15Oct 15, 2022Updated 3 years ago
farhad324 / Auto-Chloro-A-Crop-Disease-Classifier-and-Remedies-Provider-In-Bangla
View on GitHub
Auto Chloro is a plant disease classifier & remedies provider that uses deep learning. It can predict diseases and provide remedies. The …
☆13Mar 30, 2021Updated 5 years ago
MuktadirHassan / 33-js-concepts
View on GitHub
📜 33 JavaScript concepts every developer should know.
☆10Jun 21, 2022Updated 4 years ago
MathGenie / MathGenie
View on GitHub
☆14Mar 11, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Honminden / GlobalMapNet
View on GitHub
the official implementation of GlobalMapNet
☆20Jan 10, 2026Updated 6 months ago
HuangZiliAndy / RPNSD
View on GitHub
PyTorch implementation of RPNSD
☆60Jun 17, 2024Updated 2 years ago
ishine / vc-lm
View on GitHub
将任意人的音色转换为成千上万种不同音色
☆32Jun 29, 2023Updated 3 years ago
JieZzoo / Data_Trans
View on GitHub
This is a script for data format conversion on object detection and other fields.
☆18Jul 16, 2024Updated 2 years ago
Ataullha / CSE476-Machine-Learning-Lab
View on GitHub
CSE476-Machine-Learning-Lab
☆17Jul 1, 2023Updated 3 years ago
zhenghuatan / rVADfast
View on GitHub
This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…
☆154Jun 5, 2025Updated last year
Derpimort / VGGVox-PyTorch
View on GitHub
Implementing VGGVox for Speaker Identification on VoxCeleb1 dataset in PyTorch.
☆25Oct 15, 2020Updated 5 years ago