kate-egorova / ASR-hybrid-decodingLinks
This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs. The output is a mix of in-vocabulary words and phoneme sequences. This decoding is suitable for systems with only a small dictionary available and for further recovery of OOV words.
☆11Updated 5 years ago
Alternatives and similar repositories for ASR-hybrid-decoding
Users that are interested in ASR-hybrid-decoding are comparing it to the libraries listed below
Sorting:
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Updated 3 years ago
- ☆17Updated 5 years ago
- A handy dataset of noises for ASR☆22Updated 6 years ago
- ☆17Updated 6 years ago
- A SPMI Lab toolkit for language models.☆11Updated 8 years ago
- ☆16Updated 3 years ago
- wake word spotting with kaldi☆19Updated 5 years ago
- ☆21Updated 6 years ago
- magicspeech competition recipe☆18Updated 5 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Updated 2 years ago
- Unsupervised speech activity detection system.☆11Updated 7 years ago
- Perform the forced decoding with target transcription☆11Updated 7 years ago
- A KALDI/C++ implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆14Updated 6 years ago
- Losses and decoders for end-to-end ASR and OCR☆34Updated 5 years ago
- A library of speech gadgets.☆14Updated 3 years ago
- Open Source Speech/Text Data on AI☆19Updated 3 years ago
- it's ASR decoder and make graph project☆33Updated 3 years ago
- Filtering and Noise Adding Tool☆29Updated 3 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Updated 4 years ago
- A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)☆21Updated 8 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Updated 3 years ago
- Python implementation of CTC beam search decoder + agnostic LM scorer☆20Updated 5 years ago
- BurrMill core☆22Updated 4 years ago
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆23Updated 7 years ago
- Detect emotion from audio☆13Updated 7 years ago
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Updated last year
- Speaker Diarization library in Python. Performs VAD, Segmentation, Linear Clustering, Hierarchical Clustering☆15Updated 8 years ago
- ☆24Updated 5 years ago
- python wrap for hts engine☆14Updated 7 years ago
- Various algorithms for voice activity detection☆22Updated 8 years ago