A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
☆17Dec 15, 2019Updated 6 years ago
Alternatives and similar repositories for awesome-diarization
Users that are interested in awesome-diarization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Probabilistic Spherical Discriminant Analysis☆12Oct 29, 2022Updated 3 years ago
- Speaker overlap-aware Neural Diarization☆12Feb 13, 2023Updated 3 years ago
- Code for the Paper Speech Recognition and Multi-Speaker Diarization of Long Conversations☆38Jun 12, 2023Updated 2 years ago
- Original implementation of the pooling method introduced in "Speaker embeddings by modeling channel-wise correlations"☆11Sep 20, 2021Updated 4 years ago
- FEERCI: A Package for Fast non-parametric confidence intervals for Equal Error Rates☆12Mar 13, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆59Mar 28, 2025Updated last year
- Bi-encoder entity linking architecture☆52Sep 10, 2024Updated last year
- Data Dialogue enables natural language querying of databases by integrating LLMs with SQL databases.☆14May 3, 2025Updated last year
- A simple pyaudio microphone interface☆11Jul 27, 2018Updated 7 years ago
- Analyze music to detect beats, and play shuffled songs with beat-matched crossfade. Uses SDL for UI, WaveOut or SDL_audio for playback, …☆13Apr 6, 2025Updated last year
- Software for Decoding of High Order Ambisonics to Irregular Layouts☆12Mar 20, 2014Updated 12 years ago
- A Deep Convolutional Neural Network (DCNN) designed for the task of localizing human speech to 168 location classes using binaural microp…☆10Dec 16, 2017Updated 8 years ago
- Read and write HTK and HTS files from python.☆20Mar 17, 2015Updated 11 years ago
- A repo dedicated to different approaches in building a Persian Generative Chatbot.☆12Sep 7, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).☆2,166Jun 6, 2024Updated last year
- Radam+lookahead implemented by tensorflow☆11Oct 14, 2019Updated 6 years ago
- A collection of minimal examples for the sparta plug-ins.☆13Jul 12, 2025Updated 9 months ago
- This is a pytorch implementation of StarGAN-VC2.☆13Dec 17, 2019Updated 6 years ago
- ☆10Apr 7, 2022Updated 4 years ago
- ☆13Aug 13, 2023Updated 2 years ago
- list of related work on AI DJ research☆15Apr 4, 2020Updated 6 years ago
- A2B Neural Rendering of Ambisonic Recordings to Binaural☆19Aug 5, 2025Updated 9 months ago
- Simulation environment for sweep-based room impulse response measurements (student project)☆11Jun 10, 2017Updated 8 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Dual fisheye video stitching in Python3, forked from : https://github.com/cynricfu/dual-fisheye-video-stitching☆12Dec 20, 2018Updated 7 years ago
- A Rasa NLU component for composite entities.☆27May 5, 2022Updated 4 years ago
- Recurrent neural network for audio noise reduction☆12Aug 18, 2022Updated 3 years ago
- Experimental 4th-order ambisonic microphone array for the Insta360 Pro camera☆12May 16, 2024Updated last year
- Recognizing common speech commands using Keras and Tensorflow.☆10Dec 17, 2018Updated 7 years ago
- Repo for the FB AI Speech team.☆25Aug 24, 2021Updated 4 years ago
- ☆15Dec 25, 2016Updated 9 years ago
- An API wrapper for snpedia.com in Python/Flask.☆26Feb 5, 2014Updated 12 years ago
- This repository contains Google Collaboratory Notebooks for Deep Learning and Computer Vision Projects☆13Sep 29, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Python library for encoding and decoding APRS packets supporting RX/TX via APRS-IS or KISS☆13Jun 13, 2022Updated 3 years ago
- The world's fastest Python package for calculating integrated loudness (LUFS) from audio data as NumPy arrays☆26Dec 26, 2025Updated 4 months ago
- Domoticz Plugin for controlling the ESP Milight Hib☆10Sep 8, 2021Updated 4 years ago
- NLPND Lab -- creating an Alexa Skill for history facts☆19Jul 6, 2022Updated 3 years ago
- Official Implementation of Integrating Physics-Informed Vectors for Improved Wind Speed Forecasting with Neural Networks☆12Mar 24, 2025Updated last year
- This is a project of speech emotion recognition using KERAS based Semi-Generative Adversarial Networks.☆11May 17, 2018Updated 7 years ago
- HRTF data preparation for machine learning by finding common measurement angles☆12May 14, 2019Updated 6 years ago