A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
☆17Dec 15, 2019Updated 6 years ago
Alternatives and similar repositories for awesome-diarization
Users that are interested in awesome-diarization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Speaker overlap-aware Neural Diarization☆12Feb 13, 2023Updated 3 years ago
- Code for the Paper Speech Recognition and Multi-Speaker Diarization of Long Conversations☆38Jun 12, 2023Updated 2 years ago
- Original implementation of the pooling method introduced in "Speaker embeddings by modeling channel-wise correlations"☆11Sep 20, 2021Updated 4 years ago
- ☆59Mar 28, 2025Updated last year
- Bi-encoder entity linking architecture☆52Sep 10, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Data Dialogue enables natural language querying of databases by integrating LLMs with SQL databases.☆14May 3, 2025Updated 10 months ago
- A simple pyaudio microphone interface☆11Jul 27, 2018Updated 7 years ago
- Analyze music to detect beats, and play shuffled songs with beat-matched crossfade. Uses SDL for UI, WaveOut or SDL_audio for playback, …☆12Apr 6, 2025Updated 11 months ago
- Software for Decoding of High Order Ambisonics to Irregular Layouts☆12Mar 20, 2014Updated 12 years ago
- Read and write HTK and HTS files from python.☆20Mar 17, 2015Updated 11 years ago
- 🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).☆2,141Jun 6, 2024Updated last year
- A repo dedicated to different approaches in building a Persian Generative Chatbot.☆12Sep 7, 2022Updated 3 years ago
- ☆10Apr 7, 2022Updated 3 years ago
- list of related work on AI DJ research☆15Apr 4, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A2B Neural Rendering of Ambisonic Recordings to Binaural☆18Aug 5, 2025Updated 7 months ago
- Simulation environment for sweep-based room impulse response measurements (student project)☆11Jun 10, 2017Updated 8 years ago
- A Rasa NLU component for composite entities.☆27May 5, 2022Updated 3 years ago
- Recurrent neural network for audio noise reduction☆12Aug 18, 2022Updated 3 years ago
- Experimental 4th-order ambisonic microphone array for the Insta360 Pro camera☆12May 16, 2024Updated last year
- ☆15Dec 25, 2016Updated 9 years ago
- Repo for the FB AI Speech team.☆25Aug 24, 2021Updated 4 years ago
- Persian Grapheme To Phoneme with Transformer in Pytorch☆11Sep 21, 2023Updated 2 years ago
- An API wrapper for snpedia.com in Python/Flask.☆26Feb 5, 2014Updated 12 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- This repository contains Google Collaboratory Notebooks for Deep Learning and Computer Vision Projects☆13Sep 29, 2021Updated 4 years ago
- ☆13Dec 19, 2018Updated 7 years ago
- Annotated Enron Subject Line Corpus (AESLC)☆25Feb 2, 2023Updated 3 years ago
- The world's fastest Python package for calculating integrated loudness (LUFS) from audio data as NumPy arrays☆25Dec 26, 2025Updated 3 months ago
- A very basic demonstration connecting speech recognition and text-to-speech☆20May 3, 2020Updated 5 years ago
- Official Implementation of Integrating Physics-Informed Vectors for Improved Wind Speed Forecasting with Neural Networks☆12Mar 24, 2025Updated last year
- This is a project of speech emotion recognition using KERAS based Semi-Generative Adversarial Networks.☆11May 17, 2018Updated 7 years ago
- HRTF data preparation for machine learning by finding common measurement angles☆12May 14, 2019Updated 6 years ago
- a conversion of Dadegan corpus (first Persian dependency corpus) to the universal dependency version☆15Nov 26, 2025Updated 4 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- The repository is created to support a Capstone project on the topic of "Study and Implementation of Sound Source Localization Techniques…☆13Apr 27, 2021Updated 4 years ago
- Reproduction of a paper"Small-footprint keyword spotting using deep neural networks"☆12Mar 11, 2019Updated 7 years ago
- Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python☆18Jun 18, 2023Updated 2 years ago
- Auto DJ script for the Mixxx DJ software☆14Feb 8, 2018Updated 8 years ago
- Chatbot: https://github.com/ChrisRahme/fyp-chatbot☆10Jun 22, 2021Updated 4 years ago
- ☆15Aug 27, 2020Updated 5 years ago
- A ros package for visualizing a robot face with different facial expressions☆15Sep 5, 2019Updated 6 years ago