HMM, CTC, RNN-Transducer, forward-backward algorithm
☆20Sep 5, 2023Updated 2 years ago
Alternatives and similar repositories for Aligners
Users that are interested in Aligners are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.☆67Jan 7, 2026Updated 5 months ago
- Recurrent Neural Aligner☆51Apr 14, 2020Updated 6 years ago
- PyTorch CTC Decoder bindings☆14Nov 2, 2017Updated 8 years ago
- CUDA-Warp RNN-Transducer☆216Feb 22, 2023Updated 3 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- PyTorch Implementations for End-to-End Automatic Speech Recognition☆127Jun 10, 2019Updated 7 years ago
- A pytorch_lightning reimplementation of the Transducer module from ESPnet.☆78Mar 11, 2021Updated 5 years ago
- Memory efficient transducer loss computation☆70Jun 10, 2022Updated 4 years ago
- Decoders from Kaldi using OpenFst☆36Apr 10, 2026Updated 2 months ago
- ☆76Mar 18, 2022Updated 4 years ago
- ☆12Jun 10, 2021Updated 5 years ago
- An imporved version of Fastsinging singing voice synthesising system.☆21Nov 3, 2020Updated 5 years ago
- E2E-SincNet: Toward fully end-to-end speech recognition☆30Feb 1, 2020Updated 6 years ago
- ESPnet extensions for semi-supervised end-to-end speech recognition. See also https://github.com/ShigekiKarita/espnet-semi-supervised/tre…☆38Feb 13, 2020Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆41Dec 18, 2020Updated 5 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆52Oct 8, 2021Updated 4 years ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆69Updated this week
- A Fast Sequence Transducer Implementation with PyTorch Bindings☆200Sep 20, 2022Updated 3 years ago
- A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)☆17Apr 2, 2018Updated 8 years ago
- Attention-based end-to-end ASR on TIMIT in PyTorch☆18Nov 9, 2021Updated 4 years ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆80Sep 27, 2023Updated 2 years ago
- Losses and decoders for end-to-end ASR and OCR☆34Oct 30, 2020Updated 5 years ago
- PyTorch end-to-end speech recognition☆50Dec 30, 2020Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A Pytorch Implementation of Transducer Model for End-to-End Speech Recognition☆239May 12, 2020Updated 6 years ago
- Coqui Inference Engine☆41Aug 3, 2021Updated 4 years ago
- A library of speech gadgets.☆15Oct 15, 2022Updated 3 years ago
- Neural network-based forced alignment with bidirectional attention mechanism☆78Jan 17, 2025Updated last year
- End-to-End Automatic Speech Recognition on PyTorch☆304Jun 2, 2022Updated 4 years ago
- ☆16Jan 24, 2018Updated 8 years ago
- PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis☆68Aug 3, 2021Updated 4 years ago
- ☆38Mar 30, 2021Updated 5 years ago
- The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"☆39Jun 9, 2020Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆16Mar 26, 2022Updated 4 years ago
- Efficient Neural Architecture Search via Straight-Through Gradients☆13Nov 12, 2020Updated 5 years ago
- ☆16Jun 13, 2022Updated 4 years ago
- ☆38May 13, 2020Updated 6 years ago
- DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)☆301Jun 15, 2020Updated 6 years ago
- pytorch CTC implementation for ASR. Use eesen's fst decoder framework☆10Feb 27, 2020Updated 6 years ago
- Python API for reading and querying ARPA formatted language models.☆33Sep 9, 2014Updated 11 years ago