aispeech-lab / w2v-cif-bertView external linksLinks
☆37Jun 28, 2021Updated 4 years ago
Alternatives and similar repositories for w2v-cif-bert
Users that are interested in w2v-cif-bert are comparing it to the libraries listed below
Sorting:
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)☆21Sep 27, 2017Updated 8 years ago
- [ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-…☆80Jan 9, 2025Updated last year
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- ☆48Jan 8, 2021Updated 5 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- A SPMI Lab toolkit for language models.☆11Apr 12, 2017Updated 8 years ago
- ☆76Oct 25, 2021Updated 4 years ago
- unsupervised ASR (mainly phone classifier) using EODM and GAN☆12Oct 22, 2020Updated 5 years ago
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Mar 23, 2021Updated 4 years ago
- Phonetically-Oriented Word Error Rate☆36May 4, 2019Updated 6 years ago
- ☆16Nov 9, 2023Updated 2 years ago
- Mason-Alberta Phonetic Segmenter☆15Dec 16, 2025Updated last month
- ☆37Nov 22, 2025Updated 2 months ago
- Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.☆59Sep 6, 2023Updated 2 years ago
- Code for ACL-IJCNLP 2021 paper "N-Best-ASR-Transformer: Enhancing SLU Performance using Multiple ASR Hypotheses."☆17Nov 30, 2021Updated 4 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆16Jul 22, 2021Updated 4 years ago
- it's a train acoustics model code lib☆27May 20, 2020Updated 5 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Oct 8, 2021Updated 4 years ago
- ☆17Nov 25, 2019Updated 6 years ago
- ☆18Mar 13, 2024Updated last year
- ☆29Jun 15, 2022Updated 3 years ago
- Punctuation generation for speech transcripts using lexical and prosodic features☆42Mar 5, 2019Updated 6 years ago
- ☆17Apr 14, 2023Updated 2 years ago
- Confidence Estimation for Black Box Automatic Speech Recognition Systems Using Lattice Recurrent Neural Networks https://arxiv.org/abs/19…☆15Apr 16, 2020Updated 5 years ago
- A fast and lightweight python-based CTC beam search decoder for speech recognition.☆467Jul 13, 2023Updated 2 years ago
- Repository for speech paper reading☆33Aug 19, 2021Updated 4 years ago
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆49Dec 25, 2024Updated last year
- Artie Bias Corpus: an audio corpus + code for detecting demographic bias☆20Jul 21, 2020Updated 5 years ago
- ☆16Mar 7, 2019Updated 6 years ago
- Deploy Kaldi models using grpc for bidirectional streaming.☆17Sep 30, 2024Updated last year
- Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"☆23Mar 6, 2023Updated 2 years ago
- End-to-End Speech Processing Toolkit☆15Jan 20, 2025Updated last year
- ☆22Jun 30, 2021Updated 4 years ago
- ☆16Jun 13, 2022Updated 3 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆74Oct 9, 2020Updated 5 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- [ICASSP'22] Integer-only Zero-shot Quantization for Efficient Speech Recognition☆34Oct 11, 2021Updated 4 years ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Oct 26, 2020Updated 5 years ago