MingLunHan/CIF-HieraDist

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MingLunHan/CIF-HieraDist)

MingLunHan / CIF-HieraDist

[INTERSPEECH 2023] Knowledge Transfer from Pre-trained Language Models to Cif-based Recognizers via Hierarchical Distillation

☆41

Alternatives and similar repositories for CIF-HieraDist

Users that are interested in CIF-HieraDist are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MingLunHan / CIF-PyTorch
View on GitHub
[ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-…
☆78Jul 14, 2026Updated last week
MingLunHan / CIF-ColDec
View on GitHub
[ICASSP 2022] Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection
☆25Jul 14, 2026Updated last week
Miamoto / Conformer-NTM
View on GitHub
☆16Nov 9, 2023Updated 2 years ago
MiuLab / SpokenCSE
View on GitHub
Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding
☆11May 19, 2023Updated 3 years ago
tuanio / conformer-rnnt
View on GitHub
Conformer RNN-Transducer
☆14May 25, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
xiaoxue1117 / speech-mamba-public
View on GitHub
☆15Nov 26, 2024Updated last year
praveena2j / RecurrentJointAttentionwithLSTMs
View on GitHub
ICASSP 2023: "Recursive Joint Attention for Audio-Visual Fusion in Regression Based Emotion Recognition"
☆14Nov 29, 2024Updated last year
mct10 / CoBERT
View on GitHub
Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning
☆48Nov 8, 2023Updated 2 years ago
stevenhillis / awesome-asr-contextualization
View on GitHub
A curated list of awesome papers on contextualizing E2E ASR outputs
☆81May 10, 2023Updated 3 years ago
s920128 / NAR-BERT-ASR
View on GitHub
NAR-BERT-ASR
☆10Sep 27, 2021Updated 4 years ago
LingweiMeng / Whisper-Sidecar
View on GitHub
The implementation for "Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System".
☆34Aug 2, 2025Updated 11 months ago
CoraJung / flexible-input-slu
View on GitHub
This setup allows to train end-to-end neural models for spoken language understanding (SLU).
☆11Jun 12, 2023Updated 3 years ago
Xianchao-Wu / wenet-deep-sparse-conformer
View on GitHub
☆15Aug 25, 2022Updated 3 years ago
flageval-baai / ChildMandarin
View on GitHub
[ACL 2025 Main] A Comprehensive Mandarin Speech Dataset for Young Children Aged 3-5
☆55Mar 19, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
mubingshen / MLC-SLM-Baseline
View on GitHub
The project is associated with the recently-launched INTERSPEECH 2025 Workshop on Multilingual Conversational Speech Language Model (MLC-…
☆51May 14, 2025Updated last year
jctian98 / e2e_lfmmi
View on GitHub
E2E system with LF-MMI; word N-gram for Mandarin
☆167Apr 29, 2022Updated 4 years ago
idiap / contextual-biasing-on-gpus
View on GitHub
Implementation of the contextual biasing for ASR decoding on GPUs without lattice generation. The code supports submission to Interspeech…
☆21Sep 25, 2023Updated 2 years ago
Splend1d / T5lephone
View on GitHub
Code for T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5
☆19Nov 29, 2022Updated 3 years ago
Guardianzc / DISCOpen-MedBox-DialoDiagnosis
View on GitHub
An Open-Source Tool for Automatic Disease Diagnosis..
☆20Feb 25, 2024Updated 2 years ago
YosukeHiguchi / espnet
View on GitHub
End-to-End Speech Processing Toolkit
☆16Jan 20, 2025Updated last year
aispeech-lab / WASE
View on GitHub
PyTorch implementation of WASE described in our ICASSP 2021: "Wase: Learning When to Attend for Speaker Extraction in Cocktail Party Envi…
☆27Jan 11, 2022Updated 4 years ago
foamliu / Speech-Transformer
View on GitHub
PyTorch re-implementation of Speech-Transformer
☆102Nov 19, 2021Updated 4 years ago
thu-spmi / CAT
View on GitHub
CAT is more than a CRF-based ASR toolkit: it provides a complete workflow for data-efficient end-to-end ASR, supporting CTC, CTC-CRF, RNN…
☆368Feb 5, 2026Updated 5 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
DataoceanAI / CNVSRC2023Baseline
View on GitHub
Baseline system for CNVSRC2023 (Chinese Continuous Visual Speech Recognition Challenge 2023)
☆23Apr 27, 2024Updated 2 years ago
lukerbs / forcealign
View on GitHub
ForceAlign is a Python library for forced alignment of English text to English audio. You can use ForceAlign to get word or phoneme level…
☆27Dec 4, 2024Updated last year
Mddct / simple-tts
View on GitHub
（WIP）long form speech generatoins
☆30Apr 2, 2025Updated last year
George0828Zhang / torch_cif
View on GitHub
A fast parallel PyTorch implementation of the "CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition" https://arxiv.org/ab…
☆37Feb 10, 2024Updated 2 years ago
thuhcsi / Contextual-Biasing-Dataset
View on GitHub
open-source Mandarian biased word dataset
☆14Sep 21, 2023Updated 2 years ago
zyascend / End-to-End-Speech-Recognition-Learning
View on GitHub
ASR, End-to-End, end2end, Speech Recognition, 端到端语音识别
☆12Oct 25, 2020Updated 5 years ago
pyf98 / DPHuBERT
View on GitHub
INTERSPEECH 2023: "DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models"
☆118Jan 26, 2024Updated 2 years ago
eai-lab / CAFO
View on GitHub
[KDD 2024] CAFO: Feature-Centric Explanation on Time Series Classification
☆16Jun 17, 2024Updated 2 years ago
wengzhiwen / xhs_note_generator
View on GitHub
一键将视频转换为优质小红书笔记，自动优化内容和配图；追加了可以读取本地视频的功能
☆12Dec 22, 2024Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
scutcsq / Neural-Transducers-for-Two-Stage-Text-to-Speech-via-Semantic-Token-Prediction
View on GitHub
Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…
☆60Apr 4, 2024Updated 2 years ago
felixkreuk / UnsupSeg
View on GitHub
Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)
☆146Aug 5, 2022Updated 3 years ago
skit-ai / slu-prosody
View on GitHub
Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…
☆27May 17, 2023Updated 3 years ago
cuhealthybrains / MT-LLM
View on GitHub
The implementation for "Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions"
☆51Apr 7, 2025Updated last year
yxduir / LLM-SRT
View on GitHub
☆28Mar 11, 2026Updated 4 months ago
gengxuelong / wenet_LLM_from_ASLP
View on GitHub
wenet_LLM_from_ASLP
☆15Nov 26, 2024Updated last year
facebookresearch / fbai-speech
View on GitHub
Repo for the FB AI Speech team.
☆27Aug 24, 2021Updated 4 years ago