Lhx94As / E2E-language-diarizationView external linksLinks
Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>
☆19Jan 23, 2022Updated 4 years ago
Alternatives and similar repositories for E2E-language-diarization
Users that are interested in E2E-language-diarization are comparing it to the libraries listed below
Sorting:
- ☆14Jun 12, 2015Updated 10 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 5 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆34Aug 27, 2023Updated 2 years ago
- Unsupervised word segmentation and clustering of speech☆13Feb 17, 2017Updated 8 years ago
- Source code and speech samples for the DSU-AVO paper accepted to INTERSPEECH 2023☆12May 13, 2024Updated last year
- Forced alignment decoder for Whisper.☆14Mar 13, 2024Updated last year
- ☆13Mar 25, 2021Updated 4 years ago
- Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)☆12Mar 12, 2024Updated last year
- ☆30Jan 22, 2026Updated 3 weeks ago
- PolEval 2021 Task 1☆15Jun 28, 2022Updated 3 years ago
- This repository☆30Nov 13, 2022Updated 3 years ago
- ☆14Feb 9, 2023Updated 3 years ago
- ☆12Jun 10, 2021Updated 4 years ago
- Word Error Rate Estimation☆16Aug 25, 2020Updated 5 years ago
- ☆14Aug 1, 2025Updated 6 months ago
- Crawling and creating a German language model resource☆18Aug 23, 2022Updated 3 years ago
- ☆16Oct 16, 2018Updated 7 years ago
- Adnabod lleferydd Cymraeg i'r Gymraeg gyda HuggingFace // Speech Recognition for Welsh with HuggingFace☆13Nov 29, 2022Updated 3 years ago
- ☆18Mar 13, 2024Updated last year
- Example workflow for our data-centric speech benchmark☆17Jul 6, 2023Updated 2 years ago
- ☆17Jun 30, 2020Updated 5 years ago
- ☆14Jul 11, 2022Updated 3 years ago
- A fork of Idiap Research Institute's DiarTk diarization toolkit☆16Feb 20, 2016Updated 9 years ago
- ☆17Mar 1, 2024Updated last year
- Sisyphus recipies for ASR☆18Updated this week
- Consistent dictionary learning algorithm for signal declipping (Python code)☆20Oct 24, 2018Updated 7 years ago
- Artie Bias Corpus: an audio corpus + code for detecting demographic bias☆20Jul 21, 2020Updated 5 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal.☆14Sep 19, 2022Updated 3 years ago
- ☆16Mar 7, 2019Updated 6 years ago
- Clustering-based methods for overlapping diarization☆82Jan 12, 2024Updated 2 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆23Mar 21, 2021Updated 4 years ago
- This repository is the code and data for DialMed: A Dataset for Dialogue-based Medication Recommendation, COLING 2022.☆23Oct 26, 2022Updated 3 years ago
- Implementation of the DIVA model of speech acquisition and production using PyTorch☆21Jan 18, 2023Updated 3 years ago