DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.
☆19Nov 23, 2021Updated 4 years ago
Alternatives and similar repositories for Kaldi-Dsing-task
Users that are interested in Kaldi-Dsing-task are comparing it to the libraries listed below
Sorting:
- This is a subset of the DALI set consisting of 240 polyphonic recordings that is used to benchmark lyrics transcription evaluation.☆12Nov 30, 2021Updated 4 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- A complete training recipe for kaldi-based Automatic Lyrics Transcription.☆31Nov 30, 2021Updated 4 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 3 years ago
- An open-source tool for automatic speech recognition ASR quality estimation.☆23Dec 12, 2019Updated 6 years ago
- ☆24Mar 13, 2020Updated 5 years ago
- A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)☆21Sep 27, 2017Updated 8 years ago
- Text normalization scripts from IRISA lab☆14Jun 1, 2018Updated 7 years ago
- ☆10May 15, 2021Updated 4 years ago
- ☆14Jun 12, 2015Updated 10 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Jun 19, 2023Updated 2 years ago
- Scripts for computing common lyrics-to-audio alignment evaluation metrics. Usable evaluation for any token-based alignment (e.g. if tok…☆18Oct 27, 2020Updated 5 years ago
- DEPRECATED: Jamendo music dataset with time-aligned lyrics for lyrics alignment evaluation☆89Apr 30, 2025Updated 10 months ago
- A fork of Idiap Research Institute's DiarTk diarization toolkit☆16Feb 20, 2016Updated 10 years ago
- ☆17Jun 30, 2020Updated 5 years ago
- ☆17Nov 25, 2019Updated 6 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆36Apr 25, 2025Updated 10 months ago
- ☆20Nov 22, 2020Updated 5 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Apr 8, 2022Updated 3 years ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Oct 26, 2020Updated 5 years ago
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆15Dec 19, 2018Updated 7 years ago
- A baseline Automatic Speech Recognition system for Polish based on Kaldi.☆18Dec 21, 2021Updated 4 years ago
- A handy dataset of noises for ASR☆22May 29, 2019Updated 6 years ago
- Text-to-Speech tutorial at SLTU 2016☆35May 10, 2016Updated 9 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Oct 4, 2019Updated 6 years ago
- Mapping features using Deep Neural Networks (DNNs) with application to Voice Conversion (VC). The implementations are on top of Theano Py…☆32May 30, 2018Updated 7 years ago
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 10 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- Python wrapper for kaldi's arpa2fst☆38Aug 27, 2025Updated 6 months ago
- Code for the Paper Speech Recognition and Multi-Speaker Diarization of Long Conversations☆38Jun 12, 2023Updated 2 years ago
- ☆21Sep 24, 2018Updated 7 years ago
- (semi) Grapheme-to-Phoneme (G2P) - seq2seq model using PyTorch for Korean☆23Dec 17, 2017Updated 8 years ago
- Compute the most likely permutation of a lattice given an LM☆10Jan 3, 2013Updated 13 years ago
- A C++ library for parsing and manipulating JSGF grammar files.☆14Feb 13, 2024Updated 2 years ago