adamcsvarga/speaker-clustering

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/adamcsvarga/speaker-clustering)

adamcsvarga / speaker-clustering

Unsupervised Speaker Clustering & Speaker Recognition

☆13

Alternatives and similar repositories for speaker-clustering

Users that are interested in speaker-clustering are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AKBoles / Deep-Learning-Speech-Recognition
View on GitHub
Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.
☆50Feb 1, 2017Updated 9 years ago
ksingla025 / pyAudioAnalysis3
View on GitHub
python3 version of pyaudioanalysis
☆19Jan 19, 2019Updated 7 years ago
JRMeyer / speakerID-challenge
View on GitHub
A recipe for creating a Speaker Identification system built on Kaldi.
☆15Jan 2, 2020Updated 6 years ago
deezer / interpretable_nn_attribution
View on GitHub
Source code from our RecSys 2020 paper: "Making neural network interpretable with attribution: application to implicit signals prediction…
☆14Oct 2, 2020Updated 5 years ago
simonhughes22 / PythonNlpResearch
View on GitHub
☆14Dec 7, 2022Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
HyeonwooNoh / VQA-Transfer-ExternalData
View on GitHub
Transfer Learning via Unsupervised Task Discovery for Visual Question Answering
☆19Apr 8, 2019Updated 7 years ago
charlesliucn / LanMIT
View on GitHub
📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.
☆22Jul 12, 2019Updated 7 years ago
michaelbironneau / rnn-punctuation
View on GitHub
RNN model to punctuate degraded text with no punctuation, and an application that combines it with Watson TTS for automated transcription…
☆10Apr 9, 2017Updated 9 years ago
workmanjack / lyric-mood-classification
View on GitHub
UC Berkeley Masters of Information & Data Science | W266 Natural Language Processing with Deep Learning Group Project | Team: Cyprian Gas…
☆16Dec 8, 2022Updated 3 years ago
fkhannouf / BGUI
View on GitHub
BGUI stands for BOOPSI Graphical User Interface. BGUI is free GUI toolkit for the Amiga OS.
☆11Apr 29, 2025Updated last year
artie-inc / artie-bias-corpus
View on GitHub
Artie Bias Corpus: an audio corpus + code for detecting demographic bias
☆20Jul 21, 2020Updated 6 years ago
StevenLOL / LIUM
View on GitHub
Scripts for LIUM SpkDiarization tools
☆31Aug 17, 2017Updated 8 years ago
luan78zaoha / kaldi-timit-sre-ivector
View on GitHub
Develop speaker recognition model based on i-vector using TIMIT database
☆16Jul 4, 2019Updated 7 years ago
AccentDB / code
View on GitHub
Code for AccentDB.
☆24May 28, 2021Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
INGEOTEC / b4msa
View on GitHub
A Baseline for Multilingual Sentiment Analysis
☆36Oct 17, 2024Updated last year
jeroenvansaane / Deep-Learning-Based-Intrusion-Detection-NSL-KDD
View on GitHub
Deep Learning based Intrusion Detection on NSL-KDD Dataset
☆14Aug 24, 2019Updated 6 years ago
zenreach / docker-kafka-connect
View on GitHub
Kafka Connect Docker Image with Prometheus Metrics
☆12May 1, 2020Updated 6 years ago
parthe / Speaker-Diarization-toolkit-MATLAB
View on GitHub
An end-to-end MATLAB toolkit for completely unsupervised Speaker Diarization using state-of-the-art algorithms.
☆15Dec 22, 2015Updated 10 years ago
emoon / AmigaHunkParser
View on GitHub
Parser written in C for Amiga Hunk (executable) files
☆16Mar 2, 2026Updated 4 months ago
ekapolc / ASR_classproject
View on GitHub
Some tutorials used for ASR class
☆31Jul 20, 2021Updated 5 years ago
rmcpantoja / piper
View on GitHub
A fast, local neural text to speech system
☆18Feb 24, 2025Updated last year
elsheikh21 / malware-analysis
View on GitHub
using Drebin dataset to distinguish between malwares and not malwares
☆13Jan 5, 2019Updated 7 years ago
knub / sentence-boundary-detection-nn
View on GitHub
Sentence Boundary Detection using Deep Neural Networks.
☆20Oct 24, 2016Updated 9 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
voberoi / voice-search-with-whisper-duckdb-and-metaphone
View on GitHub
This repository is a voice search demo using OpenAI Whisper, DuckDB, and the Metaphone algorithm. The associate blog post is here: https:…
☆13May 15, 2024Updated 2 years ago
ikks / libreoffice-stable-diffusion
View on GitHub
LibreOffice extension to generate AI images powered by AIHorde
☆15May 4, 2026Updated 2 months ago
rhasspy / ipa2kaldi
View on GitHub
Tool for creating Kaldi nnet3 recipes using the International Phonetic Alphabet (IPA)
☆10Jun 2, 2021Updated 5 years ago
ZhaoZeyu1995 / BenNevis
View on GitHub
A Diffrentiable WFST-based End-to-End Automatic Speech Recognition toollkit with flexible topology support
☆12Feb 15, 2026Updated 5 months ago
sarang0909 / faq_chatbot
View on GitHub
COVID-19 FAQ chatbot in python along with user interfce
☆10Feb 2, 2024Updated 2 years ago
ikks / gimp-stable-diffusion
View on GitHub
Gimp3 plugin to create images with AI powered by AIHorde
☆16May 8, 2026Updated 2 months ago
Speech-Lab-IITM / English_ASR_Challenge
View on GitHub
English ASR Challenge organized by Speech Lab, IIT Madras
☆10Feb 3, 2021Updated 5 years ago
BiometricVox / DAE_SpeakerID
View on GitHub
Denoising autoencoders for speaker identification on MCE 2018 challenge
☆12Nov 8, 2018Updated 7 years ago
desh2608 / kaldi-noise-vectors
View on GitHub
Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.
☆13Feb 13, 2021Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
pquaid / pcq
View on GitHub
PCQ Pascal compiler for the Amiga
☆19Jun 14, 2015Updated 11 years ago
SpeechColab / PySpeechColab
View on GitHub
A library of speech gadgets.
☆15Oct 15, 2022Updated 3 years ago
jerrygood0703 / noise_adaptive_DAT_SE
View on GitHub
Noise Adaptive Speech Enhancement using Domain Adversarial Training
☆23Jul 25, 2019Updated 6 years ago
aalto-speech / speaker-diarization
View on GitHub
Speaker diarization scripts, based on AaltoASR
☆191Jan 3, 2019Updated 7 years ago
TehreemFarooqi / Preparing-a-speech-recognition-dataset-using-YouTube-videos
View on GitHub
Using YouTube to prepare a speech recognition dataset for any language
☆10Mar 30, 2021Updated 5 years ago
kabachuha / nanoGPKANT
View on GitHub
Testing KAN-based text generation GPT models
☆19May 6, 2024Updated 2 years ago
fedderrico / ubm_map_diarization
View on GitHub
Speaker diarization with GMM-UBM and MAP Adaptation
☆31Sep 13, 2018Updated 7 years ago