[ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-Fire mechanism).
☆79Jan 9, 2025Updated last year
Alternatives and similar repositories for CIF-PyTorch
Users that are interested in CIF-PyTorch are comparing it to the libraries listed below
Sorting:
- [INTERSPEECH 2023] Knowledge Transfer from Pre-trained Language Models to Cif-based Recognizers via Hierarchical Distillation☆41Sep 1, 2023Updated 2 years ago
- A fast parallel PyTorch implementation of the "CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition" https://arxiv.org/ab…☆36Feb 10, 2024Updated 2 years ago
- [ICASSP 2022] Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection☆25May 18, 2023Updated 2 years ago
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆12Mar 14, 2025Updated last year
- ☆37Jun 28, 2021Updated 4 years ago
- PyTorch toolkit for streaming speech recognition, speech translation and simultaneous translation based on fairseq.☆25Oct 3, 2022Updated 3 years ago
- ☆16Nov 9, 2023Updated 2 years ago
- SLT 2024 Challenge: Post-ASR-Speaker-Tagging☆16Jun 16, 2024Updated last year
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?☆10Mar 8, 2026Updated 2 weeks ago
- End-to-end Speech Translation☆35Apr 12, 2021Updated 4 years ago
- ☆37Mar 30, 2021Updated 4 years ago
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆22May 26, 2025Updated 9 months ago
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- The implementation for "Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System".☆30Aug 2, 2025Updated 7 months ago
- ☆13Sep 25, 2024Updated last year
- Pronunciation-assisted Subword Modeling☆31May 30, 2019Updated 6 years ago
- Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.☆59Sep 6, 2023Updated 2 years ago
- Optimized loss based on cross-entropy (CE), like MWER (minimum WER) Loss with beam search and negative sampling strategy, Smoothed Max Po…☆24Oct 11, 2024Updated last year
- Implementaion RNN tranceducer☆23Jun 25, 2019Updated 6 years ago
- An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.☆12Jul 5, 2021Updated 4 years ago
- NAR-BERT-ASR☆10Sep 27, 2021Updated 4 years ago
- NeMo: a toolkit for conversational AI☆13May 4, 2024Updated last year
- Implements of CTC, Speech-Transformer and CIF for end-to-end speech recognition with pytorch☆23Jul 28, 2020Updated 5 years ago
- ☆27Aug 31, 2022Updated 3 years ago
- A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)☆17Apr 2, 2018Updated 7 years ago
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆24Feb 25, 2025Updated last year
- Test-time adaptation for speech recognition model by single utterance. The official implementation of "Listen, Adapt, Better WER: Source-…☆21Apr 1, 2022Updated 3 years ago
- End-to-End Speech Processing Toolkit☆15Jan 20, 2025Updated last year
- This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).☆35Dec 17, 2024Updated last year
- ☆15Jul 4, 2024Updated last year
- open-source Mandarian biased word dataset☆14Sep 21, 2023Updated 2 years ago
- X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages☆316Aug 10, 2023Updated 2 years ago
- ☆16Jun 13, 2022Updated 3 years ago
- ☆13Oct 27, 2021Updated 4 years ago
- Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding☆11May 19, 2023Updated 2 years ago
- PyTorch implementation of WASE described in our ICASSP 2021: "Wase: Learning When to Attend for Speaker Extraction in Cocktail Party Envi…☆27Jan 11, 2022Updated 4 years ago
- A streamable speech recognition model with transformer encoders and RNN-T loss☆11Mar 1, 2021Updated 5 years ago
- Repo for the FB AI Speech team.☆25Aug 24, 2021Updated 4 years ago
- A CRF-based ASR Toolkit☆366Feb 5, 2026Updated last month