shashikg / X-Vector-Based-Speaker-DiarizationView external linksLinks
Course project for EE698R (2020-21 Sem 2). An X-Vector Based Speaker Diarization System with AutoEncoder based clustering method. Also supports spectral and KMeans clustering method.
☆16Jun 2, 2021Updated 4 years ago
Alternatives and similar repositories for X-Vector-Based-Speaker-Diarization
Users that are interested in X-Vector-Based-Speaker-Diarization are comparing it to the libraries listed below
Sorting:
- Tunable pipelines☆41Sep 9, 2025Updated 5 months ago
- 将任意人的音色转换为成千上万种不同音色☆32Jun 29, 2023Updated 2 years ago
- A hand-gesture recognition system using Doppler effect of ultrasonic.☆11Mar 2, 2019Updated 6 years ago
- real time face swap and one-click video deepfake with only a single image☆11Sep 13, 2024Updated last year
- Into the depths of some concepts of Artificial Intelligence and Machine Learning☆10Jun 10, 2025Updated 8 months ago
- ☆11Aug 11, 2023Updated 2 years ago
- [ACM MobiSys 2024 Demo] Image-based Indoor Localization using Object Detection and LSTM☆11Jun 18, 2025Updated 7 months ago
- Create short vertical videos for TikTok, YouTube Shorts, and Instagram Reels using AI. Fully automated pipeline with traceability. 🚀🎥☆17Updated this week
- Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clust…☆44Oct 21, 2020Updated 5 years ago
- Collection of Common Machine Translation Tools☆11Jul 26, 2022Updated 3 years ago
- Auto Chloro is a plant disease classifier & remedies provider that uses deep learning. It can predict diseases and provide remedies. The …☆13Mar 30, 2021Updated 4 years ago
- Open Source Speech Inferencing Libary for Indic Languages☆13Apr 11, 2022Updated 3 years ago
- ☆10Dec 10, 2023Updated 2 years ago
- ☆17Oct 8, 2023Updated 2 years ago
- ☆12Nov 5, 2020Updated 5 years ago
- A massively multilingual corpus and pretrained model for IGT☆12Updated this week
- Evaluation metrics and submission file creation scripts the Action Recognition challenge☆14Updated this week
- Code for the paper "Free-View Expressive Talking Head Video Editing" (ICASSP 2023)☆12May 26, 2024Updated last year
- This library removes the jitter and smooth the landmarks coming from Mediapipe☆13Jan 16, 2023Updated 3 years ago
- Activity Grammars for Temporal Action Segmentation (NeurIPS 2023)☆14Jun 14, 2024Updated last year
- Pythonic Nvidia Codec Library☆17Aug 3, 2022Updated 3 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- An application to solve handwritten mathematical equations using deep learning algorithms.☆13Apr 8, 2018Updated 7 years ago
- Interaction Compass: Multi-Label Zero-Shot Learning of Human-Object Interactions via Spatial Relations @ ICCV21☆13Jul 15, 2022Updated 3 years ago
- Sync Lip in Unity by Wav2Lip☆11Jan 14, 2021Updated 5 years ago
- C++17 URL Parser (RFC 3986 compliant)☆11Jan 21, 2022Updated 4 years ago
- Convolutional Fine-Grained Classification with Self-Supervised Target Relation Regularization (TIP 2022)☆12Sep 8, 2022Updated 3 years ago
- Source code for ASRU 2019 paper "Adapting Pretrained Transformer to Lattices for Spoken Language Understanding"☆10Jul 8, 2020Updated 5 years ago
- Gradio_demo.py with Blinking on Still Mode Video Creation☆12Jun 21, 2023Updated 2 years ago
- Bangla TTS Inference pipeline using Vit TTS☆13Mar 24, 2024Updated last year
- CSE476-Machine-Learning-Lab☆17Jul 1, 2023Updated 2 years ago
- A trained model of YOLOv8 which will detect Fight or Violence and NonViolence in videos☆12Sep 20, 2024Updated last year
- Spectral Clustering in C++☆17Jan 8, 2013Updated 13 years ago
- 📜 33 JavaScript concepts every developer should know.☆10Jun 21, 2022Updated 3 years ago
- Build your own frontend AI agent with Chrome☆13Jan 16, 2026Updated last month
- Official implementation of SBNet as described in "Single-branch Network for Multimodal Training".☆12Aug 28, 2023Updated 2 years ago
- Learning Pytorch☆13Jun 12, 2018Updated 7 years ago
- ☆11May 7, 2022Updated 3 years ago
- Build a Conversational AI System that can answer questions by retrieving the answers from a document.☆11Feb 23, 2024Updated last year