Course project for EE698R (2020-21 Sem 2). An X-Vector Based Speaker Diarization System with AutoEncoder based clustering method. Also supports spectral and KMeans clustering method.
☆16Jun 2, 2021Updated 5 years ago
Alternatives and similar repositories for X-Vector-Based-Speaker-Diarization
Users that are interested in X-Vector-Based-Speaker-Diarization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Contains code for Deep Self Supervised Heirarchical Clustering for Speaker Diarization☆17Dec 16, 2021Updated 4 years ago
- Token-Level Supervised Contrastive Learning for Punctuation Restoration☆29Sep 8, 2021Updated 4 years ago
- ☆31Aug 9, 2022Updated 3 years ago
- ☆26May 8, 2022Updated 4 years ago
- Implementation of the semi-structured inference model in our ACL 2020 paper, INFOTABS: Inference on Tables as Semi-structured Data.☆17Dec 7, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Tunable pipelines☆41Sep 9, 2025Updated 9 months ago
- Here we will track the latest Audio AI Agent, including speech, music, sound effects, etc.☆16Dec 8, 2023Updated 2 years ago
- Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!☆16Aug 26, 2021Updated 4 years ago
- Into the depths of some concepts of Artificial Intelligence and Machine Learning☆10Apr 4, 2026Updated 2 months ago
- BADLAD: Bengali Document Layout Analysis Dataset☆15May 12, 2024Updated 2 years ago
- This is the experimental description of MnTTS2.☆12Apr 11, 2024Updated 2 years ago
- Corpus and code for Aligned Recipe Actions (ARA) corpus, EMNLP 2021☆10May 22, 2024Updated 2 years ago
- client-side deep learning super resolution using TensorFlow.js☆15Nov 7, 2021Updated 4 years ago
- Train and finutune text-to-speech models for Bengali and many other languages!☆18Apr 2, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Bangla TTS Inference pipeline using Vit TTS☆13Mar 24, 2024Updated 2 years ago
- 📜 33 JavaScript concepts every developer should know.☆10Jun 21, 2022Updated 4 years ago
- Build a Conversational AI System that can answer questions by retrieving the answers from a document.☆11Feb 23, 2024Updated 2 years ago
- [Computer Speech & Language] A transformer-based spelling error correction framework for Bangla and resource scarce Indic languages☆14Aug 9, 2024Updated last year
- ☆11Nov 5, 2020Updated 5 years ago
- Discriminative Neural Clustering for Speaker Diarisation☆79Apr 8, 2022Updated 4 years ago
- ☆14Mar 11, 2024Updated 2 years ago
- 将任意人的音色转换为成千上万种不同音色☆32Jun 29, 2023Updated 3 years ago
- An open-source initiative to transcribe Silôṭi Nagri-Bānglā, and vice-versa. It's still in Alpha mode. See the demo:☆12Apr 9, 2021Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- PyTorch implementation of RPNSD☆60Jun 17, 2024Updated 2 years ago
- Auto Chloro is a plant disease classifier & remedies provider that uses deep learning. It can predict diseases and provide remedies. The …☆13Mar 30, 2021Updated 5 years ago
- A massively multilingual corpus and pretrained model for IGT☆13Jun 4, 2026Updated 3 weeks ago
- Use BERT for Question Answering and finetune train with SQuAD 2.0☆15Oct 12, 2019Updated 6 years ago
- Speedy Camera Fingerprinting Library☆23Feb 17, 2022Updated 4 years ago
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆154Jun 5, 2025Updated last year
- ☆11May 7, 2022Updated 4 years ago
- Bert-Based persian spell-checker☆19Mar 9, 2024Updated 2 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Adding custom virtual backgrounds to video stream☆16Jul 24, 2022Updated 3 years ago
- Heterogeneous Multi-agent Version of Highway-env☆18Jun 28, 2023Updated 3 years ago
- A lightweight library to compute Diarization Error Rate (DER).☆62Jan 14, 2026Updated 5 months ago
- Resources and Tool for Bangla language computation☆14Feb 20, 2026Updated 4 months ago
- CSE476-Machine-Learning-Lab☆17Jul 1, 2023Updated 2 years ago
- Open Source Speech Inferencing Libary for Indic Languages☆13Apr 11, 2022Updated 4 years ago
- Automatic Context Sensitive Spelling Correction for Bangla Text Using Bert and Levenstein Distance☆21Nov 18, 2024Updated last year