[ICASSP 2022] Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection
☆25May 18, 2023Updated 2 years ago
Alternatives and similar repositories for CIF-ColDec
Users that are interested in CIF-ColDec are comparing it to the libraries listed below
Sorting:
- ☆13Sep 25, 2024Updated last year
- [ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-…☆80Jan 9, 2025Updated last year
- Toward Multi Modality Language Model - implementation of GPT-4o/Project Astra☆16Dec 10, 2024Updated last year
- Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning☆48Nov 8, 2023Updated 2 years ago
- Taiwanese Speech Synthesis with Tacotron2☆25Oct 2, 2022Updated 3 years ago
- ASR text preprocessing utility☆21Aug 5, 2024Updated last year
- [INTERSPEECH 2023] Knowledge Transfer from Pre-trained Language Models to Cif-based Recognizers via Hierarchical Distillation☆41Sep 1, 2023Updated 2 years ago
- Super Flappy Bird in p5.js☆10Mar 8, 2021Updated 4 years ago
- Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…☆23Mar 18, 2024Updated last year
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Mar 15, 2023Updated 2 years ago
- The Android application providing user with REST-based interface for utilizing built-in Android's TTS engine. The web service is highly c…☆11Jul 28, 2020Updated 5 years ago
- ☆10Apr 17, 2024Updated last year
- Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together☆46Jul 3, 2025Updated 7 months ago
- Target speaker automatic speech recognition (TS-ASR)☆12Oct 14, 2023Updated 2 years ago
- Neural network sequence labeling model - some sloppy modifications to the original toolkit to enable punctuation restoration in unsegment…☆10Jan 8, 2017Updated 9 years ago
- Repo for the FB AI Speech team.☆25Aug 24, 2021Updated 4 years ago
- A curated list of awesome papers on contextualizing E2E ASR outputs☆80May 10, 2023Updated 2 years ago
- ☆10Sep 19, 2022Updated 3 years ago
- ☆14Nov 26, 2024Updated last year
- A SPMI Lab toolkit for language models.☆11Apr 12, 2017Updated 8 years ago
- PyTorch toolkit for streaming speech recognition, speech translation and simultaneous translation based on fairseq.☆25Oct 3, 2022Updated 3 years ago
- The implementation for "Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System".☆30Aug 2, 2025Updated 6 months ago
- This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).☆35Dec 17, 2024Updated last year
- [ICASSP2023] Source code, model links and open test sets for paper SeACo-Paraformer.☆39Mar 15, 2024Updated last year
- ☆11Oct 14, 2023Updated 2 years ago
- Taiwanese Translation with BERT based model and RNN. Collection of Taiwanese text corpus☆13Oct 15, 2022Updated 3 years ago
- ☆17Oct 18, 2023Updated 2 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Jun 2, 2023Updated 2 years ago
- ☆16Nov 9, 2023Updated 2 years ago
- Textless (ASR-transcript free) Spoken Question Answering. The official release of NMSQA dataset and the implementation of "DUAL: Textless…☆35Aug 10, 2023Updated 2 years ago
- ☆31Jul 13, 2023Updated 2 years ago
- ☆15Sep 9, 2021Updated 4 years ago
- ☆15Aug 25, 2022Updated 3 years ago
- **Interspeech 2022** 《SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks》Speec…☆101Apr 10, 2025Updated 10 months ago
- ☆18Mar 13, 2024Updated last year
- A publishing website of a table collecting meta-learning-related papers in the area of human language processing.☆17Aug 2, 2021Updated 4 years ago
- ☆37Jul 4, 2024Updated last year
- ☆37Jun 28, 2021Updated 4 years ago