Efficient Neural Architecture Search via Straight-Through Gradients
☆13Nov 12, 2020Updated 5 years ago
Alternatives and similar repositories for ST-NAS
Users that are interested in ST-NAS are comparing it to the libraries listed below
Sorting:
- A SPMI Lab toolkit for language models.☆11Apr 12, 2017Updated 8 years ago
- Augmenting Room Impulse Response☆43Sep 15, 2023Updated 2 years ago
- Decoders from Kaldi using OpenFst☆34Jan 29, 2026Updated last month
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- Pronunciation-assisted Subword Modeling☆31May 30, 2019Updated 6 years ago
- Custom decoders for Kaldi☆13Jun 5, 2019Updated 6 years ago
- An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.☆12Jul 5, 2021Updated 4 years ago
- APAM toolkit is built on PyTorch and provides recipes to adapt pretrained acoustic models with a variety of sequence discriminative train…☆14Feb 15, 2021Updated 5 years ago
- Curriculum Vitae of Quan Wang☆15Dec 13, 2025Updated 2 months ago
- ☆13Mar 25, 2021Updated 4 years ago
- Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.☆59Sep 6, 2023Updated 2 years ago
- Code for ACL-IJCNLP 2021 paper "N-Best-ASR-Transformer: Enhancing SLU Performance using Multiple ASR Hypotheses."☆17Nov 30, 2021Updated 4 years ago
- ☆17Nov 25, 2019Updated 6 years ago
- ☆17Apr 14, 2023Updated 2 years ago
- Mispronunciation detection code for jingju singing voice☆20Sep 5, 2018Updated 7 years ago
- A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)☆17Apr 2, 2018Updated 7 years ago
- A pytorch wrapper for LF-MMI training and parallel training in Kaldi☆73Jun 8, 2022Updated 3 years ago
- ESPnet extensions for semi-supervised end-to-end speech recognition. See also https://github.com/ShigekiKarita/espnet-semi-supervised/tre…☆38Feb 13, 2020Updated 6 years ago
- ☆20Nov 22, 2020Updated 5 years ago
- Speech Resynthesis and Language Modeling☆27Jun 11, 2025Updated 8 months ago
- Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…☆20May 12, 2023Updated 2 years ago
- An imporved version of Fastsinging singing voice synthesising system.☆20Nov 3, 2020Updated 5 years ago
- Keyword spotting for audio with attention (KWS model for audio)☆18Jul 15, 2021Updated 4 years ago
- HMM, CTC, RNN-Transducer, forward-backward algorithm☆20Sep 5, 2023Updated 2 years ago
- An effort to track benchmarking results over widely-used datasets for ASR.☆52Dec 19, 2025Updated 2 months ago
- A CRF-based ASR Toolkit☆364Feb 5, 2026Updated last month
- using microphone☆17Sep 2, 2021Updated 4 years ago
- Yichi Zhang et al. A Probabilistic End-To-End Task-Oriented Dialog Model with Latent Belief States towards Semi-Supervised Learning. EMNL…☆20Nov 5, 2020Updated 5 years ago
- ☆54Feb 24, 2026Updated last week
- Implementation of meta-transfer-learning for ASR and LM (ACL 2020)☆52Jul 30, 2020Updated 5 years ago
- Various algorithms for voice activity detection☆22Jan 31, 2017Updated 9 years ago
- AISHELL开源数据标注平台,包含语音,图像标注,数据质检,验收,统计等功能.☆25Dec 23, 2019Updated 6 years ago
- Auto Segmentation Criterion (ASG) implemented in pytorch☆51Oct 1, 2021Updated 4 years ago
- Neural Network Audio FingerPrint☆63Mar 5, 2023Updated 3 years ago
- ☆24Sep 25, 2018Updated 7 years ago
- A neural speech codec based on discrete WavLM representations☆24Aug 28, 2024Updated last year
- Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"☆23Mar 3, 2020Updated 6 years ago
- Conversion of recurrent neural network language models to weighted finite state transducers☆58Jun 1, 2018Updated 7 years ago
- Tensorflow version of DFSMN☆49Jul 17, 2018Updated 7 years ago