jim-schwoebel/awesome-diarization

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jim-schwoebel/awesome-diarization)

jim-schwoebel / awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

☆17

Alternatives and similar repositories for awesome-diarization

Users that are interested in awesome-diarization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

bsxfan / PSDA
View on GitHub
Probabilistic Spherical Discriminant Analysis
☆12Oct 29, 2022Updated 3 years ago
calclavia / tal-asrd
View on GitHub
Code for the Paper Speech Recognition and Multi-Speaker Diarization of Long Conversations
☆39Jun 12, 2023Updated 3 years ago
tstafylakis / Speaker-Embeddings-Correlation-Pooling
View on GitHub
Original implementation of the pooling method introduced in "Speaker embeddings by modeling channel-wise correlations"
☆11Sep 20, 2021Updated 4 years ago
feerci / feerci
View on GitHub
FEERCI: A Package for Fast non-parametric confidence intervals for Equal Error Rates
☆12Mar 13, 2024Updated 2 years ago
BUTSpeechFIT / EEND_dataprep
View on GitHub
☆59Mar 28, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ggeop / DataDialogueLLM
View on GitHub
Data Dialogue enables natural language querying of databases by integrating LLMs with SQL databases.
☆15May 3, 2025Updated last year
facebookresearch / BELA
View on GitHub
Bi-encoder entity linking architecture
☆52Sep 10, 2024Updated last year
MycroftAI / pylisten
View on GitHub
A simple pyaudio microphone interface
☆11Jul 27, 2018Updated 8 years ago
ghunkins / Binaural-Source-Localization-CNN
View on GitHub
A Deep Convolutional Neural Network (DCNN) designed for the task of localizing human speech to 168 location classes using binaural microp…
☆10Dec 16, 2017Updated 8 years ago
schoobani / persian-generative-chatbot
View on GitHub
A repo dedicated to different approaches in building a Persian Generative Chatbot.
☆12Sep 7, 2022Updated 3 years ago
MattShannon / htk_io
View on GitHub
Read and write HTK and HTS files from python.
☆20Mar 17, 2015Updated 11 years ago
Raptor007 / AutoDJ
View on GitHub
Analyze music to detect beats, and play shuffled songs with beat-matched crossfade. Uses SDL for UI, WaveOut or SDL_audio for playback, …
☆14Apr 6, 2025Updated last year
jim-schwoebel / voice_datasets
View on GitHub
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
☆2,212Jun 6, 2024Updated 2 years ago
jin-woo-lee / nfs-binaural
View on GitHub
☆13Aug 13, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
zyzisyz / mfa_conformer
View on GitHub
☆160Jan 9, 2023Updated 3 years ago
HazouPH / android_vendor_intel_houdini
View on GitHub
☆15Dec 25, 2016Updated 9 years ago
ariacat3366 / pytorch-StarGAN-VC2-implementation
View on GitHub
This is a pytorch implementation of StarGAN-VC2.
☆13Dec 17, 2019Updated 6 years ago
aidanmomo / Speech-Enhancement-Metrics-SNR-SDRi-SISDRi
View on GitHub
☆10Apr 7, 2022Updated 4 years ago
spatialaudio / sweep
View on GitHub
Simulation environment for sweep-based room impulse response measurements (student project)
☆11Jun 10, 2017Updated 9 years ago
kharrigian / mental-health-keywords
View on GitHub
Keywords and phrases that can be used for identifying mental-health-related conversation on Twitter
☆12Jun 18, 2020Updated 6 years ago
suzhengpeng / dual-fisheye-video-stitching-in-Python3
View on GitHub
Dual fisheye video stitching in Python3, forked from : https://github.com/cynricfu/dual-fisheye-video-stitching
☆13Dec 20, 2018Updated 7 years ago
AppliedAcousticsChalmers / ambisonics-for-insta360-pro
View on GitHub
Experimental 4th-order ambisonic microphone array for the Insta360 Pro camera
☆12May 16, 2024Updated 2 years ago
mir-aidj / awesome-aidj
View on GitHub
list of related work on AI DJ research
☆15Apr 4, 2020Updated 6 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Benjamin-Tsui / HRTF_preprocessing
View on GitHub
HRTF data preparation for machine learning by finding common measurement angles
☆12May 14, 2019Updated 7 years ago
sajadalipour7 / Persian-Grapheme-To-Phoneme-With-Transformer
View on GitHub
Persian Grapheme To Phoneme with Transformer in Pytorch
☆11Sep 21, 2023Updated 2 years ago
lucko515 / Speech-commands-recognition
View on GitHub
Recognizing common speech commands using Keras and Tensorflow.
☆10Dec 17, 2018Updated 7 years ago
vasugupta9 / DeepLearningProjects
View on GitHub
This repository contains Google Collaboratory Notebooks for Deep Learning and Computer Vision Projects
☆13Sep 29, 2021Updated 4 years ago
PranavPutsa1006 / Speaker-Diarization
View on GitHub
Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python
☆18Jun 18, 2023Updated 3 years ago
lisaalaz / satbot
View on GitHub
An empathetic counselling chatbot. Retrieval-based, uses finetuned LMs for emotion identification and to boost empathy, novelty and fluen…
☆18Jun 8, 2023Updated 3 years ago
david8862 / rnnoise
View on GitHub
Recurrent neural network for audio noise reduction
☆12Aug 18, 2022Updated 3 years ago
ChrisRahme / FYP-Webapp
View on GitHub
Chatbot: https://github.com/ChrisRahme/fyp-chatbot
☆10Jun 22, 2021Updated 5 years ago
facebookresearch / fbai-speech
View on GitHub
Repo for the FB AI Speech team.
☆27Aug 24, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
alirezasalemi7 / ARMAN
View on GitHub
ARMAN: Pre-training with Semantically Selecting and Reordering of Sentences for Persian Abstractive Summarization
☆11Oct 3, 2021Updated 4 years ago
keithecarlson / Zero-Shot-Style-Transfer
View on GitHub
☆13Dec 19, 2018Updated 7 years ago
rosinality / melgan-pytorch
View on GitHub
MelGAN and Tacotron 2 in PyTorch
☆11Oct 22, 2019Updated 6 years ago
UniversalDependencies / UD_Persian-PerDT
View on GitHub
a conversion of Dadegan corpus (first Persian dependency corpus) to the universal dependency version
☆14May 6, 2026Updated 2 months ago
python-aprs / aprs3
View on GitHub
Python library for encoding and decoding APRS packets supporting RX/TX via APRS-IS or KISS
☆13Jun 13, 2022Updated 4 years ago
haiciyang / Remixing
View on GitHub
Official repo of ICASSP 2022 paper - Don't Separate, Learn to Remix: End-to-End Neural Remixing with Joint Optimization
☆20Jan 7, 2025Updated last year
yingtaoHuo / wakeUp
View on GitHub
Reproduction of a paper"Small-footprint keyword spotting using deep neural networks"
☆12Mar 11, 2019Updated 7 years ago