cadia-lvl/kaldi-speaker-diarization

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/cadia-lvl/kaldi-speaker-diarization)

cadia-lvl / kaldi-speaker-diarization

This repository creates speaker diarization recipes to be used within the egs folder of kaldi.

☆17

Alternatives and similar repositories for kaldi-speaker-diarization

Users that are interested in kaldi-speaker-diarization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

scarletcho / prep4kaldi
View on GitHub
Data preparation code for building Kaldi ASR system
☆14Mar 18, 2017Updated 9 years ago
jtkim-kaist / end-point-detection
View on GitHub
☆10Sep 19, 2018Updated 7 years ago
atlijas / icelandic-stop-words
View on GitHub
☆14Mar 15, 2024Updated 2 years ago
cadia-lvl / samromur-asr
View on GitHub
Automatic Speech Recognition (ASR) system for the Samrómur speech corpus using Kaldi
☆12Sep 30, 2022Updated 3 years ago
cadia-lvl / icelandic-NLP-resources
View on GitHub
Overview of Icelandic NLP resources at a glance
☆18Jun 20, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
cadia-lvl / ice-asr
View on GitHub
An automatic speech recognition environment for Icelandic based on Kaldi
☆14Oct 12, 2017Updated 8 years ago
idnavid / speech_activity_detection
View on GitHub
Unsupervised speech activity detection system.
☆11Jul 2, 2018Updated 8 years ago
srinivr / kaldi-long-audio-alignment
View on GitHub
Long audio alignment using Kaldi
☆23Apr 22, 2021Updated 5 years ago
falabrasil / ufpalign
View on GitHub
👄🇧🇷 Alinhamento fonético forçado em Português Brasileiro
☆13Jul 18, 2025Updated last year
py-lidbox / lidbox
View on GitHub
End-to-end spoken language identification out of the box.
☆48Dec 13, 2020Updated 5 years ago
PedroEstevesPT / kaldi_toy_example
View on GitHub
Toy example to illustrate how to use kaldi recipes.
☆13Mar 11, 2021Updated 5 years ago
sveinbjornt / iceaddr
View on GitHub
Python package to look up information about Icelandic street addresses, postcodes and placenames. Icelandic geocoding and reverse geocodi…
☆29Jul 13, 2026Updated last week
LCF2764 / autoKWS2021_1st_solution
View on GitHub
Auto-KWS 2021 Challenge 1st place solution.
☆11Jul 20, 2021Updated 5 years ago
isca-sig-rosp / ISCA-SIG-RoSP
View on GitHub
Web page for ISCA Special Interest Group: Robust Speech Processing (RoSP)
☆11Dec 4, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
qcri / ArabicASRChallenge2016
View on GitHub
This repository
☆32Nov 13, 2022Updated 3 years ago
liuhao-lh / SMD
View on GitHub
Pytorch implementation of 'Improving Self-supervised Lightweight Model Learning via Hard-aware Metric Distillation. In ECCV 2022'
☆11Mar 22, 2023Updated 3 years ago
rhasspy / phonetisaurus-pypi
View on GitHub
Python wrapper for phonetisaurus grapheme to phoneme tool
☆12Mar 11, 2021Updated 5 years ago
liutaocode / DiarizationVisualization
View on GitHub
Visualization tools for audio-only and multi-modal speaker diarization dataset
☆13Oct 27, 2023Updated 2 years ago
cadia-lvl / punctuation-prediction
View on GitHub
Support tools for punctuation and boundary detection for ASR output.
☆55Dec 8, 2022Updated 3 years ago
desh2608 / spyder
View on GitHub
Simple Python package for fast DER computation
☆35Jun 29, 2023Updated 3 years ago
pearapple123 / rime-hoisanva
View on GitHub
A RIME IME for Taishanese
☆11Aug 3, 2023Updated 2 years ago
guttih / DisplayMenu
View on GitHub
A library to create a menu on a LCD color display
☆11Aug 22, 2021Updated 4 years ago
SinuXVR / xDuoo-X3II-custom-firmware
View on GitHub
☆16Mar 23, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
rlucioni / recipes
View on GitHub
Collection of recipes adapted from books, shows, the Internet, and more
☆12Updated this week
juanmc2005 / rttm-viewer
View on GitHub
Application for viewing Rich Transcription Time Marked (RTTM) files in an interactive way
☆48Apr 19, 2023Updated 3 years ago
finos / greenkey-asrtoolkit
View on GitHub
A collection of useful tools for handling speech recognition data
☆30Nov 28, 2022Updated 3 years ago
charlesliucn / LanMIT
View on GitHub
📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.
☆22Jul 12, 2019Updated 7 years ago
cvqluu / nn-similarity-diarization
View on GitHub
Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clust…
☆43Oct 21, 2020Updated 5 years ago
Open-Speech-EkStep / ULCA-asr-dataset-corpus
View on GitHub
☆50Nov 23, 2022Updated 3 years ago
hwanyyy / preprocessing-of-speech
View on GitHub
VAD + resampling | High resolution spectrogram
☆14Nov 29, 2022Updated 3 years ago
triplet02 / KoNPron
View on GitHub
Convert Numerical Representations to Korean Pronunciation
☆14Apr 20, 2020Updated 6 years ago
bekirbakar / replay-attack-detection
View on GitHub
Deep learning-based audio spoofing attack detection experiments for speaker verification.
☆14Apr 20, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
hlt-bme-hu / hunspeech
View on GitHub
☆14Jan 24, 2017Updated 9 years ago
hirofumi0810 / asr_preprocessing
View on GitHub
Python implementation of pre-processing for End-to-End speech recognition
☆70Feb 19, 2018Updated 8 years ago
rungjoo / Emotion_not_One
View on GitHub
The Emotion is Not One-hot Encoding: Learning with Grayscale Label for Emotion Recognition in Conversation (INTERSPEECH 2022)
☆15Oct 19, 2022Updated 3 years ago
PiSchool / spoken-language-id
View on GitHub
Spoken Language Identification from Short Utterances
☆13Jul 6, 2022Updated 4 years ago
HaukurPall / ruv_dl
View on GitHub
RÚV-DL (ruv-dl) is terminal line client for downloading content from RÚV
☆10Dec 16, 2025Updated 7 months ago
takumakanari / japanese-numbers-python
View on GitHub
A parser for Japanese number (Kanji, arabic) in the natural language.
☆21Apr 4, 2020Updated 6 years ago
pushitchaudhary / PushitChaudhary
View on GitHub
☆12Sep 4, 2024Updated last year