calclavia/tal-asrd

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/calclavia/tal-asrd)

calclavia / tal-asrd

Code for the Paper Speech Recognition and Multi-Speaker Diarization of Long Conversations

☆39

Alternatives and similar repositories for tal-asrd

Users that are interested in tal-asrd are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

asteroid-team / pytorch-pit
View on GitHub
Permutation invariant training in PyTorch
☆13Oct 2, 2020Updated 5 years ago
BUTSpeechFIT / EEND_dataprep
View on GitHub
☆59Mar 28, 2025Updated last year
speechpro / mixup
View on GitHub
☆24Mar 13, 2020Updated 6 years ago
desh2608 / pytorch-tdnn
View on GitHub
Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training
☆41Dec 18, 2020Updated 5 years ago
popcornell / MicRank
View on GitHub
MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.
☆22Apr 8, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
desh2608 / dover-lap
View on GitHub
Python package for combining diarization system outputs.
☆94Oct 12, 2023Updated 2 years ago
BUTSpeechFIT / AMI-diarization-setup
View on GitHub
☆54Oct 17, 2023Updated 2 years ago
desh2608 / diarizer
View on GitHub
Clustering-based methods for overlapping diarization
☆84Jan 12, 2024Updated 2 years ago
qiujiali / lattice-rescore
View on GitHub
☆16Jun 13, 2022Updated 4 years ago
cyfer0618 / kaldi-pytorch-rnnlm
View on GitHub
Enable RNNLM lattice rescoring with Pytorch [kaldi]
☆12Jun 5, 2020Updated 6 years ago
feerci / feerci
View on GitHub
FEERCI: A Package for Fast non-parametric confidence intervals for Equal Error Rates
☆12Mar 13, 2024Updated 2 years ago
asteroid-team / Libri_VAD
View on GitHub
Script to generate VAD dataset used in Asteroid recipe
☆21Sep 30, 2021Updated 4 years ago
hitachi-speech / EEND
View on GitHub
End-to-End Neural Diarization
☆435Aug 30, 2021Updated 4 years ago
jim-schwoebel / awesome-diarization
View on GitHub
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
☆17Dec 15, 2019Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
desh2608 / kaldi-noise-vectors
View on GitHub
Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.
☆13Feb 13, 2021Updated 5 years ago
HuangZiliAndy / RPNSD
View on GitHub
PyTorch implementation of RPNSD
☆60Jun 17, 2024Updated 2 years ago
alumae / online_speaker_change_detector
View on GitHub
Online streaming speaker change detection model in Pytorch
☆44Apr 14, 2023Updated 3 years ago
kate-egorova / ASR-hybrid-decoding
View on GitHub
This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…
☆11Feb 4, 2020Updated 6 years ago
fgnt / mms_msg
View on GitHub
Multipurpose Multi Speaker Mixture Signal Generator
☆46Feb 6, 2025Updated last year
isca-sig-rosp / ISCA-SIG-RoSP
View on GitHub
Web page for ISCA Special Interest Group: Robust Speech Processing (RoSP)
☆11Dec 4, 2023Updated 2 years ago
ICASSP2021-tutorial9 / Distant_conversational_ASR_and_analysis
View on GitHub
☆12Jun 10, 2021Updated 5 years ago
csteinmetz1 / pyloudnorm-eval
View on GitHub
Evaluation of a number of loudness meter implementations
☆13Aug 28, 2021Updated 4 years ago
csukuangfj / kaldilm
View on GitHub
Python wrapper for kaldi's arpa2fst
☆38Aug 27, 2025Updated 11 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
yinruiqing / diarization_with_neural_approach
View on GitHub
☆14Aug 9, 2018Updated 7 years ago
assafmu / wav2letter_pytorch
View on GitHub
An implementation of the Wav2Letter Speech-to-Text model using PyTorch.
☆14Mar 8, 2023Updated 3 years ago
schufo / tisms
View on GitHub
This is the code of the ICASSP 2020 paper "Joint phoneme alignment and text-informed speech separation on highly corrupted speech"
☆16Apr 8, 2024Updated 2 years ago
JuanFMontesinos / torch_mir_eval
View on GitHub
Backpropagable pytorch implementation of https://craffel.github.io/mir_eval/.
☆35Jul 8, 2024Updated 2 years ago
chimechallenge / chime-utils
View on GitHub
Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.
☆26Feb 25, 2025Updated last year
m-wiesner / nnet_pytorch
View on GitHub
Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.
☆26Jul 25, 2024Updated 2 years ago
etzinis / biased_separation
View on GitHub
Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation
☆14Nov 16, 2020Updated 5 years ago
dogancan / expected-edit-distance
View on GitHub
Expected edit distance implementation using OpenFst tools
☆11May 13, 2015Updated 11 years ago
wq2012 / SimpleDER
View on GitHub
A lightweight library to compute Diarization Error Rate (DER).
☆62Jan 14, 2026Updated 6 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
jtrmal / kaldi2020
View on GitHub
☆27Jan 19, 2021Updated 5 years ago
asappresearch / multistream-cnn
View on GitHub
Multistream CNN for Robust Acoustic Modeling
☆40Jun 17, 2021Updated 5 years ago
nttcslab-sp / EEND-vector-clustering
View on GitHub
This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…
☆81Oct 18, 2022Updated 3 years ago
asteroid-team / asteroid-filterbanks
View on GitHub
Asteroid's filterbanks
☆90Jan 12, 2025Updated last year
vishalshar / SpeakerDiarization_RNN_CNN_LSTM
View on GitHub
Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…
☆64Jan 8, 2021Updated 5 years ago
egorsmkv / asr-corpus-creator
View on GitHub
This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.
☆27Feb 15, 2024Updated 2 years ago
revdotcom / words2num
View on GitHub
Convert words to numbers
☆21Apr 13, 2022Updated 4 years ago