A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
☆17Dec 15, 2019Updated 6 years ago
Alternatives and similar repositories for awesome-diarization
Users that are interested in awesome-diarization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Original implementation of the pooling method introduced in "Speaker embeddings by modeling channel-wise correlations"☆11Sep 20, 2021Updated 4 years ago
- FEERCI: A Package for Fast non-parametric confidence intervals for Equal Error Rates☆12Mar 13, 2024Updated 2 years ago
- ☆59Mar 28, 2025Updated last year
- Data Dialogue enables natural language querying of databases by integrating LLMs with SQL databases.☆14May 3, 2025Updated last year
- A simple pyaudio microphone interface☆11Jul 27, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Analyze music to detect beats, and play shuffled songs with beat-matched crossfade. Uses SDL for UI, WaveOut or SDL_audio for playback, …☆13Apr 6, 2025Updated last year
- A Deep Convolutional Neural Network (DCNN) designed for the task of localizing human speech to 168 location classes using binaural microp…☆10Dec 16, 2017Updated 8 years ago
- Read and write HTK and HTS files from python.☆20Mar 17, 2015Updated 11 years ago
- 🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).☆2,191Jun 6, 2024Updated last year
- ☆15Sep 19, 2024Updated last year
- Radam+lookahead implemented by tensorflow☆11Oct 14, 2019Updated 6 years ago
- Keywords and phrases that can be used for identifying mental-health-related conversation on Twitter☆12Jun 18, 2020Updated 5 years ago
- A collection of minimal examples for the sparta plug-ins.☆14Jul 12, 2025Updated 10 months ago
- ☆10Apr 7, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆13Aug 13, 2023Updated 2 years ago
- ☆160Jan 9, 2023Updated 3 years ago
- list of related work on AI DJ research☆15Apr 4, 2020Updated 6 years ago
- A2B Neural Rendering of Ambisonic Recordings to Binaural☆18Aug 5, 2025Updated 9 months ago
- Simulation environment for sweep-based room impulse response measurements (student project)☆11Jun 10, 2017Updated 8 years ago
- Dual fisheye video stitching in Python3, forked from : https://github.com/cynricfu/dual-fisheye-video-stitching☆13Dec 20, 2018Updated 7 years ago
- Recurrent neural network for audio noise reduction☆12Aug 18, 2022Updated 3 years ago
- Experimental 4th-order ambisonic microphone array for the Insta360 Pro camera☆12May 16, 2024Updated 2 years ago
- Recognizing common speech commands using Keras and Tensorflow.☆10Dec 17, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆15Dec 25, 2016Updated 9 years ago
- Persian Grapheme To Phoneme with Transformer in Pytorch☆11Sep 21, 2023Updated 2 years ago
- This repository contains Google Collaboratory Notebooks for Deep Learning and Computer Vision Projects☆13Sep 29, 2021Updated 4 years ago
- Python Wrapper of visqol☆11Dec 23, 2024Updated last year
- ☆13Dec 19, 2018Updated 7 years ago
- Python library for encoding and decoding APRS packets supporting RX/TX via APRS-IS or KISS☆13Jun 13, 2022Updated 3 years ago
- Domoticz Plugin for controlling the ESP Milight Hib☆10Sep 8, 2021Updated 4 years ago
- Official Implementation of Integrating Physics-Informed Vectors for Improved Wind Speed Forecasting with Neural Networks☆12Mar 24, 2025Updated last year
- A very basic demonstration connecting speech recognition and text-to-speech☆20May 3, 2020Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- a conversion of Dadegan corpus (first Persian dependency corpus) to the universal dependency version☆14May 6, 2026Updated 3 weeks ago
- HRTF data preparation for machine learning by finding common measurement angles☆12May 14, 2019Updated 7 years ago
- The repository is created to support a Capstone project on the topic of "Study and Implementation of Sound Source Localization Techniques…☆13Apr 27, 2021Updated 5 years ago
- Reproduction of a paper"Small-footprint keyword spotting using deep neural networks"☆12Mar 11, 2019Updated 7 years ago
- The world's fastest Python package for calculating integrated loudness (LUFS) from audio data as NumPy arrays☆28Dec 26, 2025Updated 5 months ago
- Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python☆18Jun 18, 2023Updated 2 years ago
- Auto DJ script for the Mixxx DJ software☆15Feb 8, 2018Updated 8 years ago