AudioVisual Diarization - Supervised and Unsupervised
☆15Nov 22, 2022Updated 3 years ago
Alternatives and similar repositories for av_diarization
Users that are interested in av_diarization are comparing it to the libraries listed below
Sorting:
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- ☆11Nov 28, 2025Updated 3 months ago
- Source code for ASRU 2019 paper "Adapting Pretrained Transformer to Lattices for Spoken Language Understanding"☆10Jul 8, 2020Updated 5 years ago
- Transformer based ASR Engine.☆13Aug 23, 2021Updated 4 years ago
- Code to accompany the paper "Learning Grimaces By Watching TV" and FaceValue dataset☆12Aug 4, 2018Updated 7 years ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆34Jun 25, 2021Updated 4 years ago
- ☆16Dec 23, 2021Updated 4 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆18Nov 30, 2022Updated 3 years ago
- Automatic Speech Recognition at the University of Edinburgh.☆16Mar 14, 2021Updated 4 years ago
- Mispronunciation detection code for jingju singing voice☆20Sep 5, 2018Updated 7 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2021☆18Jul 21, 2021Updated 4 years ago
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Jul 25, 2024Updated last year
- C++ Implementation of the Information Bottleneck System☆22Jan 9, 2019Updated 7 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Jul 5, 2022Updated 3 years ago
- ☆24Mar 13, 2020Updated 5 years ago
- ☆22Mar 22, 2017Updated 8 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆33Oct 23, 2025Updated 4 months ago
- A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)☆21Sep 27, 2017Updated 8 years ago
- Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"☆23Mar 3, 2020Updated 5 years ago
- Decoders from Kaldi using OpenFst☆34Jan 29, 2026Updated last month
- Convert words to numbers☆21Apr 13, 2022Updated 3 years ago
- Repo for the FB AI Speech team.☆25Aug 24, 2021Updated 4 years ago
- TF code for our CVPR2020 paper "Discriminative Multi-modality Speech Recognition"☆26Apr 27, 2022Updated 3 years ago
- Baseline kaldi script for UA-SPEECH corpus☆32Oct 16, 2024Updated last year
- ☆25Mar 12, 2022Updated 3 years ago
- Conversion of recurrent neural network language models to weighted finite state transducers☆58Jun 1, 2018Updated 7 years ago
- Pytorch implementation of RawNeXt: Speaker verification system for variable-duration utterance with deep layer aggregation and dynamic sc…☆25Jun 22, 2022Updated 3 years ago
- ☆64May 23, 2022Updated 3 years ago
- Framework for Detection Evaluation (F4DE) : set of evaluation tools for detection evaluations and for specific NIST-coordinated evaluatio…☆25Jul 6, 2017Updated 8 years ago
- Grapheme-to-Phoneme conversion with Joint-Sequence RnnLMs☆31Dec 15, 2014Updated 11 years ago
- CMU multilingual speech repository☆30Apr 15, 2022Updated 3 years ago
- ☆32Jun 26, 2023Updated 2 years ago
- A Tiny Project For ASR model training and Deployment☆26Oct 14, 2022Updated 3 years ago
- ☆74Apr 4, 2024Updated last year
- ☆60Sep 26, 2020Updated 5 years ago
- Diarization scoring tools.☆263Mar 28, 2023Updated 2 years ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆68Jan 5, 2026Updated last month
- ☆28Dec 22, 2021Updated 4 years ago