m-wiesner / nnet_pytorchView external linksLinks
Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.
☆26Jul 25, 2024Updated last year
Alternatives and similar repositories for nnet_pytorch
Users that are interested in nnet_pytorch are comparing it to the libraries listed below
Sorting:
- ☆17Nov 25, 2019Updated 6 years ago
- ☆17Apr 14, 2023Updated 2 years ago
- Expected edit distance implementation using OpenFst tools☆11May 13, 2015Updated 10 years ago
- APAM toolkit is built on PyTorch and provides recipes to adapt pretrained acoustic models with a variety of sequence discriminative train…☆14Feb 15, 2021Updated 4 years ago
- ☆27Jan 19, 2021Updated 5 years ago
- ☆21Jan 13, 2020Updated 6 years ago
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- ☆10Mar 20, 2021Updated 4 years ago
- code for paper "learning to fool the speaker recognition"☆10Jun 12, 2020Updated 5 years ago
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant☆10Aug 12, 2019Updated 6 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- asr2k☆52Jun 2, 2024Updated last year
- A pytorch wrapper for LF-MMI training and parallel training in Kaldi☆73Jun 8, 2022Updated 3 years ago
- 24-hour Automatic Speech Recognition☆27Jun 4, 2021Updated 4 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆74Oct 9, 2020Updated 5 years ago
- ☆28Jan 29, 2021Updated 5 years ago
- Chinese-ASR built on kaldi☆14Jan 21, 2019Updated 7 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Oct 2, 2024Updated last year
- Code to accompany the paper "Learning Grimaces By Watching TV" and FaceValue dataset☆12Aug 4, 2018Updated 7 years ago
- Moved to https://github.com/k2-fsa/icefall☆146Oct 13, 2022Updated 3 years ago
- A GPU language model, based on btree backed tries.☆29Mar 6, 2018Updated 7 years ago
- CS224S Course Project☆14Jun 9, 2014Updated 11 years ago
- Kaldi extended by Kaituo XU with new features in nnet1.☆12Dec 16, 2018Updated 7 years ago
- ☆14Jun 12, 2015Updated 10 years ago
- A simple tutorial on setting up Sparrowhawk - a text-to-speech normalization engine☆14Oct 16, 2017Updated 8 years ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆174Jun 9, 2023Updated 2 years ago
- PyTorch implementation of LF-MMI for End-to-end ASR☆220Jan 14, 2021Updated 5 years ago
- Support tools for punctuation and boundary detection for ASR output.☆55Dec 8, 2022Updated 3 years ago
- AudioVisual Diarization - Supervised and Unsupervised☆15Nov 22, 2022Updated 3 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Jun 19, 2023Updated 2 years ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Oct 26, 2020Updated 5 years ago
- An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.☆67Jan 7, 2026Updated last month
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Mar 6, 2023Updated 2 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Mar 2, 2022Updated 3 years ago
- INACTIVE - http://mzl.la/ghe-archive - Tools to create ARPA models from cmu pocketsphinx dictionaries for proper g2p generation☆21Mar 29, 2019Updated 6 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.☆65Jan 2, 2020Updated 6 years ago
- ☆67Mar 25, 2022Updated 3 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆43Aug 3, 2022Updated 3 years ago