A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
☆17Dec 15, 2019Updated 6 years ago
Alternatives and similar repositories for awesome-diarization
Users that are interested in awesome-diarization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Probabilistic Spherical Discriminant Analysis☆12Oct 29, 2022Updated 3 years ago
- Speaker overlap-aware Neural Diarization☆12Feb 13, 2023Updated 3 years ago
- Code for the Paper Speech Recognition and Multi-Speaker Diarization of Long Conversations☆38Jun 12, 2023Updated 3 years ago
- Original implementation of the pooling method introduced in "Speaker embeddings by modeling channel-wise correlations"☆11Sep 20, 2021Updated 4 years ago
- FEERCI: A Package for Fast non-parametric confidence intervals for Equal Error Rates☆12Mar 13, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆59Mar 28, 2025Updated last year
- A simple pyaudio microphone interface☆11Jul 27, 2018Updated 7 years ago
- Software for Decoding of High Order Ambisonics to Irregular Layouts☆13Mar 20, 2014Updated 12 years ago
- Read and write HTK and HTS files from python.☆20Mar 17, 2015Updated 11 years ago
- 🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).☆2,201Jun 6, 2024Updated 2 years ago
- A repo dedicated to different approaches in building a Persian Generative Chatbot.☆12Sep 7, 2022Updated 3 years ago
- ☆15Sep 19, 2024Updated last year
- A collection of minimal examples for the sparta plug-ins.☆14Jul 12, 2025Updated 11 months ago
- This is a pytorch implementation of StarGAN-VC2.☆13Dec 17, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆10Apr 7, 2022Updated 4 years ago
- ☆13Aug 13, 2023Updated 2 years ago
- ☆160Jan 9, 2023Updated 3 years ago
- list of related work on AI DJ research☆15Apr 4, 2020Updated 6 years ago
- A2B Neural Rendering of Ambisonic Recordings to Binaural☆18Aug 5, 2025Updated 10 months ago
- Simulation environment for sweep-based room impulse response measurements (student project)☆11Jun 10, 2017Updated 9 years ago
- Dual fisheye video stitching in Python3, forked from : https://github.com/cynricfu/dual-fisheye-video-stitching☆13Dec 20, 2018Updated 7 years ago
- Experimental 4th-order ambisonic microphone array for the Insta360 Pro camera☆12May 16, 2024Updated 2 years ago
- Recognizing common speech commands using Keras and Tensorflow.☆10Dec 17, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Python Wrapper of visqol☆11Dec 23, 2024Updated last year
- ☆13Dec 19, 2018Updated 7 years ago
- Python library for encoding and decoding APRS packets supporting RX/TX via APRS-IS or KISS☆13Jun 13, 2022Updated 4 years ago
- Official Implementation of Integrating Physics-Informed Vectors for Improved Wind Speed Forecasting with Neural Networks☆13Mar 24, 2025Updated last year
- This is a project of speech emotion recognition using KERAS based Semi-Generative Adversarial Networks.☆11May 17, 2018Updated 8 years ago
- The repository is created to support a Capstone project on the topic of "Study and Implementation of Sound Source Localization Techniques…☆13Apr 27, 2021Updated 5 years ago
- Reproduction of a paper"Small-footprint keyword spotting using deep neural networks"☆12Mar 11, 2019Updated 7 years ago
- Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python☆18Jun 18, 2023Updated 3 years ago
- Chatbot: https://github.com/ChrisRahme/fyp-chatbot☆10Jun 22, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆15Aug 27, 2020Updated 5 years ago
- A ros package for visualizing a robot face with different facial expressions☆15Sep 5, 2019Updated 6 years ago
- Recursive Partitioning for Structural Equation Models☆21Mar 26, 2026Updated 2 months ago
- Code for scaling Transformers☆26Dec 2, 2020Updated 5 years ago
- A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统☆14Mar 18, 2019Updated 7 years ago
- Paderwasn is a collection of methods for acoustic signal processing in wireless acoustic sensor networks (WASNs).☆19May 8, 2025Updated last year
- ☆20Nov 22, 2020Updated 5 years ago