Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.
☆26Jul 25, 2024Updated last year
Alternatives and similar repositories for nnet_pytorch
Users that are interested in nnet_pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆17Nov 25, 2019Updated 6 years ago
- ☆17Apr 14, 2023Updated 2 years ago
- Expected edit distance implementation using OpenFst tools☆11May 13, 2015Updated 10 years ago
- APAM toolkit is built on PyTorch and provides recipes to adapt pretrained acoustic models with a variety of sequence discriminative train…☆14Feb 15, 2021Updated 5 years ago
- ☆21Jan 13, 2020Updated 6 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆27Jan 19, 2021Updated 5 years ago
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant☆10Aug 12, 2019Updated 6 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- A pytorch wrapper for LF-MMI training and parallel training in Kaldi☆73Jun 8, 2022Updated 3 years ago
- A GPU language model, based on btree backed tries.☆29Mar 6, 2018Updated 8 years ago
- Speech to text library for Rhasspy using Kaldi☆15Dec 9, 2023Updated 2 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Oct 2, 2024Updated last year
- Segment a given audio into utterances using a trained end-to-end ASR model.☆74Oct 9, 2020Updated 5 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆48Jan 8, 2021Updated 5 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- PyTorch implementation of LF-MMI for End-to-end ASR☆221Jan 14, 2021Updated 5 years ago
- [EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion☆36Sep 9, 2025Updated 6 months ago
- asr2k☆52Jun 2, 2024Updated last year
- Support tools for punctuation and boundary detection for ASR output.☆55Dec 8, 2022Updated 3 years ago
- 24-hour Automatic Speech Recognition☆27Jun 4, 2021Updated 4 years ago
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆24Feb 25, 2025Updated last year
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆175Jun 9, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Kaldi extended by Kaituo XU with new features in nnet1.☆12Dec 16, 2018Updated 7 years ago
- ☆16Jun 13, 2022Updated 3 years ago
- create CMakeLists.txt for kaldi☆20Apr 30, 2020Updated 5 years ago
- Android ORM framework.☆20Jul 1, 2015Updated 10 years ago
- Moved to https://github.com/k2-fsa/icefall☆146Oct 13, 2022Updated 3 years ago
- ☆10Mar 20, 2021Updated 5 years ago
- ☆22Jul 8, 2021Updated 4 years ago
- A simple tutorial on setting up Sparrowhawk - a text-to-speech normalization engine☆14Oct 16, 2017Updated 8 years ago
- Chinese-ASR built on kaldi☆14Jan 21, 2019Updated 7 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Code to accompany the paper "Learning Grimaces By Watching TV" and FaceValue dataset☆12Aug 4, 2018Updated 7 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Mar 2, 2022Updated 4 years ago
- DEPRECATED - A webapp for collecting speech samples for voice recognition testing and training☆20May 23, 2019Updated 6 years ago
- Using speaker embedding for diarization in PyTorch☆17Aug 29, 2020Updated 5 years ago
- An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.☆67Jan 7, 2026Updated 2 months ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Jun 19, 2023Updated 2 years ago
- Julia package for Probabilistic Canonical Correlation Analysis☆12Mar 30, 2022Updated 3 years ago