shanguanma / AlignersView external linksLinks
HMM, CTC, RNN-Transducer, forward-backward algorithm
☆20Sep 5, 2023Updated 2 years ago
Alternatives and similar repositories for Aligners
Users that are interested in Aligners are comparing it to the libraries listed below
Sorting:
- An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.☆67Jan 7, 2026Updated last month
- Memory efficient transducer loss computation☆69Jun 10, 2022Updated 3 years ago
- A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)☆17Apr 2, 2018Updated 7 years ago
- PyTorch CTC Decoder bindings☆14Nov 2, 2017Updated 8 years ago
- PyTorch Implementations for End-to-End Automatic Speech Recognition☆127Jun 10, 2019Updated 6 years ago
- Recurrent Neural Aligner☆51Apr 14, 2020Updated 5 years ago
- E2E-SincNet: Toward fully end-to-end speech recognition☆30Feb 1, 2020Updated 6 years ago
- ☆76Mar 18, 2022Updated 3 years ago
- An imporved version of Fastsinging singing voice synthesising system.☆20Nov 3, 2020Updated 5 years ago
- CUDA-Warp RNN-Transducer☆216Feb 22, 2023Updated 2 years ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆41Dec 18, 2020Updated 5 years ago
- Losses and decoders for end-to-end ASR and OCR☆34Oct 30, 2020Updated 5 years ago
- A Fast Sequence Transducer Implementation with PyTorch Bindings☆199Sep 20, 2022Updated 3 years ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆68Jan 5, 2026Updated last month
- ESPnet extensions for semi-supervised end-to-end speech recognition. See also https://github.com/ShigekiKarita/espnet-semi-supervised/tre…☆38Feb 13, 2020Updated 6 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- pytorch CTC implementation for ASR. Use eesen's fst decoder framework☆10Feb 27, 2020Updated 5 years ago
- Decoders from Kaldi using OpenFst☆34Jan 29, 2026Updated 2 weeks ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Oct 8, 2021Updated 4 years ago
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant☆10Aug 12, 2019Updated 6 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 3 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- A Pytorch Implementation of Transducer Model for End-to-End Speech Recognition☆239May 12, 2020Updated 5 years ago
- Efficient Neural Architecture Search via Straight-Through Gradients☆13Nov 12, 2020Updated 5 years ago
- Materials of public talks given By SJTU X-LANCE members☆14Dec 3, 2022Updated 3 years ago
- PyTorch end-to-end speech recognition☆49Dec 30, 2020Updated 5 years ago
- A pytorch_lightning reimplementation of the Transducer module from ESPnet.☆78Mar 11, 2021Updated 4 years ago
- End-to-End Automatic Speech Recognition on PyTorch☆304Jun 2, 2022Updated 3 years ago
- 端到端语音识别实现;包含LAS、CTC、RNNT解码方式,模型SA(MHA)、LSTM、CNN、DFSMN等☆15Jun 4, 2021Updated 4 years ago
- ☆16Apr 4, 2022Updated 3 years ago
- ☆16Jan 24, 2018Updated 8 years ago
- ☆12Jun 10, 2021Updated 4 years ago
- ☆37Mar 30, 2021Updated 4 years ago
- Coqui Inference Engine☆40Aug 3, 2021Updated 4 years ago
- deep-learning based audio-visual lip bometrics☆15May 9, 2023Updated 2 years ago
- Python API for reading and querying ARPA formatted language models.☆33Sep 9, 2014Updated 11 years ago
- PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis☆69Aug 3, 2021Updated 4 years ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆80Sep 27, 2023Updated 2 years ago
- Automatically constructing corpus for automatic speech recognition from YouTube videos☆156Feb 15, 2020Updated 6 years ago