[ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-Fire mechanism).
☆80Jan 9, 2025Updated last year
Alternatives and similar repositories for CIF-PyTorch
Users that are interested in CIF-PyTorch are comparing it to the libraries listed below
Sorting:
- [INTERSPEECH 2023] Knowledge Transfer from Pre-trained Language Models to Cif-based Recognizers via Hierarchical Distillation☆41Sep 1, 2023Updated 2 years ago
- A fast parallel PyTorch implementation of the "CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition" https://arxiv.org/ab…☆36Feb 10, 2024Updated 2 years ago
- [ICASSP 2022] Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection☆25May 18, 2023Updated 2 years ago
- ☆37Jun 28, 2021Updated 4 years ago
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆11Mar 14, 2025Updated 11 months ago
- ☆16Nov 9, 2023Updated 2 years ago
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- ☆13Sep 25, 2024Updated last year
- PyTorch toolkit for streaming speech recognition, speech translation and simultaneous translation based on fairseq.☆25Oct 3, 2022Updated 3 years ago
- An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.☆12Jul 5, 2021Updated 4 years ago
- Pronunciation-assisted Subword Modeling☆31May 30, 2019Updated 6 years ago
- End-to-end Speech Translation☆35Apr 12, 2021Updated 4 years ago
- SLT 2024 Challenge: Post-ASR-Speaker-Tagging☆16Jun 16, 2024Updated last year
- Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.☆59Sep 6, 2023Updated 2 years ago
- ☆37Mar 30, 2021Updated 4 years ago
- Optimized loss based on cross-entropy (CE), like MWER (minimum WER) Loss with beam search and negative sampling strategy, Smoothed Max Po…☆24Oct 11, 2024Updated last year
- The implementation for "Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System".☆30Aug 2, 2025Updated 7 months ago
- ☆15Jul 4, 2024Updated last year
- ☆27Aug 31, 2022Updated 3 years ago
- This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).☆35Dec 17, 2024Updated last year
- it's a train acoustics model code lib☆27May 20, 2020Updated 5 years ago
- A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)☆17Apr 2, 2018Updated 7 years ago
- End-to-End Speech Processing Toolkit☆15Jan 20, 2025Updated last year
- ☆16Jun 13, 2022Updated 3 years ago
- Test-time adaptation for speech recognition model by single utterance. The official implementation of "Listen, Adapt, Better WER: Source-…☆20Apr 1, 2022Updated 3 years ago
- it's ASR decoder and make graph project☆33May 26, 2022Updated 3 years ago
- A SPMI Lab toolkit for language models.☆11Apr 12, 2017Updated 8 years ago
- NAR-BERT-ASR☆10Sep 27, 2021Updated 4 years ago
- ☆10Sep 19, 2022Updated 3 years ago
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆21May 26, 2025Updated 9 months ago
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆24Feb 25, 2025Updated last year
- Towards Comprehensive Evaluation for End-to-End Spoken Dialogue Models☆50Sep 2, 2025Updated 6 months ago
- Simple Python package for fast DER computation☆35Jun 29, 2023Updated 2 years ago
- Memory efficient transducer loss computation☆69Jun 10, 2022Updated 3 years ago
- PyTorch implementation of WASE described in our ICASSP 2021: "Wase: Learning When to Attend for Speaker Extraction in Cocktail Party Envi…☆26Jan 11, 2022Updated 4 years ago
- E2E system with LF-MMI; word N-gram for Mandarin☆166Apr 29, 2022Updated 3 years ago
- Curriculum Vitae of Quan Wang☆15Dec 13, 2025Updated 2 months ago
- ☆13Oct 27, 2021Updated 4 years ago
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?☆10Feb 18, 2026Updated last week