Course project for EE698R (2020-21 Sem 2). An X-Vector Based Speaker Diarization System with AutoEncoder based clustering method. Also supports spectral and KMeans clustering method.
☆16Jun 2, 2021Updated 4 years ago
Alternatives and similar repositories for X-Vector-Based-Speaker-Diarization
Users that are interested in X-Vector-Based-Speaker-Diarization are comparing it to the libraries listed below
Sorting:
- Tunable pipelines☆41Sep 9, 2025Updated 5 months ago
- 将任意人的音色转换为成千上万种不同音色☆32Jun 29, 2023Updated 2 years ago
- A hand-gesture recognition system using Doppler effect of ultrasonic.☆11Mar 2, 2019Updated 7 years ago
- real time face swap and one-click video deepfake with only a single image☆12Sep 13, 2024Updated last year
- Create short vertical videos for TikTok, YouTube Shorts, and Instagram Reels using AI. Fully automated pipeline with traceability. 🚀🎥☆22Updated this week
- ☆11Aug 11, 2023Updated 2 years ago
- [ACM MobiSys 2024 Demo] Image-based Indoor Localization using Object Detection and LSTM☆12Feb 12, 2026Updated 3 weeks ago
- ☆10Nov 26, 2024Updated last year
- Activity Grammars for Temporal Action Segmentation (NeurIPS 2023)☆14Jun 14, 2024Updated last year
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- Code for the paper "Free-View Expressive Talking Head Video Editing" (ICASSP 2023)☆12May 26, 2024Updated last year
- Open Source Speech Inferencing Libary for Indic Languages☆13Apr 11, 2022Updated 3 years ago
- An application to solve handwritten mathematical equations using deep learning algorithms.☆13Apr 8, 2018Updated 7 years ago
- jitsi meet video call with gstreamer☆11Nov 25, 2021Updated 4 years ago
- Collection of Common Machine Translation Tools☆11Jul 26, 2022Updated 3 years ago
- 藏语威利转写☆11Jul 19, 2016Updated 9 years ago
- ☆17Oct 8, 2023Updated 2 years ago
- This library removes the jitter and smooth the landmarks coming from Mediapipe☆13Jan 16, 2023Updated 3 years ago
- Auto Chloro is a plant disease classifier & remedies provider that uses deep learning. It can predict diseases and provide remedies. The …☆13Mar 30, 2021Updated 4 years ago
- Interaction Compass: Multi-Label Zero-Shot Learning of Human-Object Interactions via Spatial Relations @ ICCV21☆13Jul 15, 2022Updated 3 years ago
- Traefik configuration for Jitsi Meet on Docker☆15Oct 21, 2021Updated 4 years ago
- ☆11May 7, 2022Updated 3 years ago
- Bangla TTS Inference pipeline using Vit TTS☆13Mar 24, 2024Updated last year
- CSE476-Machine-Learning-Lab☆17Jul 1, 2023Updated 2 years ago
- A massively multilingual corpus and pretrained model for IGT☆14Feb 21, 2026Updated 2 weeks ago
- An open-source initiative to transcribe Silôṭi Nagri-Bānglā, and vice-versa. It's still in Alpha mode. See the demo:☆12Apr 9, 2021Updated 4 years ago
- Build a Conversational AI System that can answer questions by retrieving the answers from a document.☆11Feb 23, 2024Updated 2 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- Python library for automated phone call testing using PJSIP☆10Aug 24, 2017Updated 8 years ago
- A trained model of YOLOv8 which will detect Fight or Violence and NonViolence in videos☆13Sep 20, 2024Updated last year
- Sync Lip in Unity by Wav2Lip☆11Jan 14, 2021Updated 5 years ago
- NAR-BERT-ASR☆10Sep 27, 2021Updated 4 years ago
- C++17 URL Parser (RFC 3986 compliant)☆11Jan 21, 2022Updated 4 years ago
- 📜 33 JavaScript concepts every developer should know.☆10Jun 21, 2022Updated 3 years ago
- Official implementation of SBNet as described in "Single-branch Network for Multimodal Training".☆12Aug 28, 2023Updated 2 years ago
- Gradio_demo.py with Blinking on Still Mode Video Creation☆12Jun 21, 2023Updated 2 years ago
- Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals☆18Aug 8, 2024Updated last year
- ☆11Nov 28, 2025Updated 3 months ago
- Showcase of P2P HLS streaming using WebTorrent☆12May 5, 2021Updated 4 years ago