hfutami / distill-bert-for-seq2seq-asr
☆24Updated 4 years ago
Alternatives and similar repositories for distill-bert-for-seq2seq-asr:
Users that are interested in distill-bert-for-seq2seq-asr are comparing it to the libraries listed below
- Token-Level Supervised Contrastive Learning for Punctuation Restoration☆29Updated 3 years ago
- ☆16Updated 2 years ago
- End-to-end Speech Translation☆36Updated 3 years ago
- ☆36Updated 2 years ago
- This repository is for the paper Incorporating External POS Tagger for Punctuation Restoration. Proc. Interspeech 2021, 1987-1991, doi: 1…☆11Updated last year
- ☆20Updated 3 years ago
- mWER loss implementation in tensorflow☆31Updated 4 years ago
- ☆15Updated 2 years ago
- Learning ASR-Robust Contextualized Embeddings for Spoken Language Understanding☆24Updated 2 years ago
- The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"☆38Updated 4 years ago
- Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.☆57Updated last year
- Unsupervised spoken sentence embeddings☆14Updated 2 years ago
- ASCEND Chinese-English code-switching dataset☆24Updated 2 years ago
- CMU multilingual speech repository☆31Updated 2 years ago
- Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning☆46Updated last year
- An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.☆68Updated 4 years ago
- End-to-End Speech Processing Toolkit☆13Updated 2 months ago
- Recurrent Neural Aligner☆49Updated 4 years ago
- ☆28Updated 2 years ago
- [ICASSP 2022] Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection☆25Updated last year
- Python wrapper for kaldi's arpa2fst☆38Updated 3 months ago
- This is an implementation of paper "End-to-end Speech Translation via Cross-modal Progressive Training" (Interspeech2021)☆20Updated 2 years ago
- unsupervised ASR (mainly phone classifier) using EODM and GAN☆12Updated 4 years ago
- This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models☆46Updated 3 years ago
- Implementation of Imputer: Sequence Modelling via Imputation and Dynamic Programming in PyTorch☆59Updated 4 years ago
- A pytorch_lightning reimplementation of the Transducer module from ESPnet.☆76Updated 4 years ago
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Updated 4 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 3 years ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆44Updated 3 years ago
- LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models☆25Updated 7 months ago