HMM, CTC, RNN-Transducer, forward-backward algorithm
☆20Sep 5, 2023Updated 2 years ago
Alternatives and similar repositories for Aligners
Users that are interested in Aligners are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.☆67Jan 7, 2026Updated 4 months ago
- Recurrent Neural Aligner☆51Apr 14, 2020Updated 6 years ago
- PyTorch CTC Decoder bindings☆14Nov 2, 2017Updated 8 years ago
- CUDA-Warp RNN-Transducer☆216Feb 22, 2023Updated 3 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- PyTorch Implementations for End-to-End Automatic Speech Recognition☆127Jun 10, 2019Updated 6 years ago
- A pytorch_lightning reimplementation of the Transducer module from ESPnet.☆78Mar 11, 2021Updated 5 years ago
- Memory efficient transducer loss computation☆70Jun 10, 2022Updated 3 years ago
- Decoders from Kaldi using OpenFst☆34Apr 10, 2026Updated 3 weeks ago
- ☆76Mar 18, 2022Updated 4 years ago
- ☆12Jun 10, 2021Updated 4 years ago
- An imporved version of Fastsinging singing voice synthesising system.☆21Nov 3, 2020Updated 5 years ago
- E2E-SincNet: Toward fully end-to-end speech recognition☆30Feb 1, 2020Updated 6 years ago
- ESPnet extensions for semi-supervised end-to-end speech recognition. See also https://github.com/ShigekiKarita/espnet-semi-supervised/tre…☆38Feb 13, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆41Dec 18, 2020Updated 5 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆52Oct 8, 2021Updated 4 years ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆68Apr 9, 2026Updated 3 weeks ago
- A Fast Sequence Transducer Implementation with PyTorch Bindings☆200Sep 20, 2022Updated 3 years ago
- A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)☆17Apr 2, 2018Updated 8 years ago
- Attention-based end-to-end ASR on TIMIT in PyTorch☆18Nov 9, 2021Updated 4 years ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆80Sep 27, 2023Updated 2 years ago
- Losses and decoders for end-to-end ASR and OCR☆34Oct 30, 2020Updated 5 years ago
- PyTorch end-to-end speech recognition☆50Dec 30, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A Pytorch Implementation of Transducer Model for End-to-End Speech Recognition☆239May 12, 2020Updated 5 years ago
- Coqui Inference Engine☆41Aug 3, 2021Updated 4 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- Neural network-based forced alignment with bidirectional attention mechanism☆78Jan 17, 2025Updated last year
- End-to-End Automatic Speech Recognition on PyTorch☆304Jun 2, 2022Updated 3 years ago
- ☆16Jan 24, 2018Updated 8 years ago
- PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis☆69Aug 3, 2021Updated 4 years ago
- ☆37Mar 30, 2021Updated 5 years ago
- The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"☆39Jun 9, 2020Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 4 years ago
- Efficient Neural Architecture Search via Straight-Through Gradients☆13Nov 12, 2020Updated 5 years ago
- ☆16Jun 13, 2022Updated 3 years ago
- DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)☆301Jun 15, 2020Updated 5 years ago
- ☆38May 13, 2020Updated 5 years ago
- pytorch CTC implementation for ASR. Use eesen's fst decoder framework☆10Feb 27, 2020Updated 6 years ago
- Automatic differentiation with weighted finite-state transducers.☆127Apr 12, 2022Updated 4 years ago