NKU-HLT/KNN-CTC

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NKU-HLT/KNN-CTC)

NKU-HLT / KNN-CTC

[ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels

☆42

Alternatives and similar repositories for KNN-CTC

Users that are interested in KNN-CTC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

NKU-HLT / Fusion-Insider-threat-detection
View on GitHub
[ICANN 2023] Anomaly-Based Insider Threat Detection via Hierarchical Information Fusion
☆18Nov 20, 2023Updated 2 years ago
NKU-HLT / Emotion-Recognition
View on GitHub
Paper List
☆18Jul 2, 2025Updated last year
NKU-HLT / Role-Play-Prompting
View on GitHub
[NAACL 2024] Better Zero-Shot Reasoning with Role-Play Prompting
☆36Nov 14, 2023Updated 2 years ago
NKU-HLT / RAMP_MOS
View on GitHub
[IEEE TASLP] Retrieval-Augmented MOS Prediction with Prior Knowledge Integration
☆33Mar 23, 2025Updated last year
NKU-HLT / PromptRank
View on GitHub
[ACL 2023] PromptRank: Unsupervised Keyphrase Extraction Using Prompt
☆51May 16, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
NKU-HLT / DIFFA
View on GitHub
[AAAI 2026 & ACL 2026] The official implementation of the DIFFA series for dLLM-based large audio language model
☆83Apr 7, 2026Updated 3 months ago
xuchennlp / S2T
View on GitHub
The project for speech translation
☆12Sep 28, 2023Updated 2 years ago
NKU-HLT / MusicEval-baseline
View on GitHub
☆12Apr 18, 2025Updated last year
NKU-HLT / PB-DSR
View on GitHub
[Interspeech 2024] Enhancing Dysarthric Speech Recognition for Unseen Speakers via Prototype-Based Adaptation
☆14Nov 28, 2024Updated last year
NKU-HLT / AudioEditor
View on GitHub
☆47Apr 2, 2025Updated last year
7Xin / DPI-TTS
View on GitHub
☆13Sep 12, 2024Updated last year
NKU-HLT / SpeechLLM-as-Judges
View on GitHub
[ACL 2026]
☆25Dec 6, 2025Updated 7 months ago
fyvo / WMT-Biomed-Test
View on GitHub
☆13Aug 23, 2024Updated last year
uthree / ddsp-vocoder
View on GitHub
☆12Nov 7, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
NKU-HLT / DiffEditor
View on GitHub
[NCMMSC]
☆16Feb 19, 2025Updated last year
tzyll / ChineseHP
View on GitHub
Dataset for Pinyin Regularization in Error Correction for Chinese Speech Recognition with Large Language Models in Interspeech 2024.
☆16Jul 4, 2024Updated 2 years ago
ryota-komatsu / speaker_disentangled_hubert
View on GitHub
Official repository of the IEEE OJSP paper "Speaker-Disentangled Chunk-Wise Regression for Syllabic Tokenization"
☆46Updated this week
pigzach / MagicSpeechASR
View on GitHub
magicspeech competition recipe
☆18Jun 29, 2020Updated 6 years ago
adelacvg / DPTTS
View on GitHub
An AR+AR TTS attempt.
☆18Jan 13, 2025Updated last year
tuanio / nextformer
View on GitHub
PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"
☆10Dec 15, 2022Updated 3 years ago
semanticVAD / testsets
View on GitHub
Testing sets for semanticVAD
☆20Feb 18, 2025Updated last year
bagustris / s3prl-ser
View on GitHub
S3PRL for Speech Emotion Recognition (see s3prl > downstream)
☆15Feb 28, 2026Updated 5 months ago
huutuongtu / Lightvoc
View on GitHub
LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM
☆18May 17, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
wonjune-kang / expressive-speech-retrieval
View on GitHub
Expressive Speech Retrieval using Natural Language Descriptions of Speaking Style
☆15Aug 18, 2025Updated 11 months ago
reppy4620 / convnext_tts
View on GitHub
Unofficial implementation of ConvNeXt-TTS powered by lightning
☆18Oct 20, 2024Updated last year
NKU-HLT / EmotionTalk
View on GitHub
Dataset [ACL 2026]
☆35Jul 31, 2025Updated 11 months ago
Aisaka0v0 / TS-Whisper
View on GitHub
☆33Jun 12, 2025Updated last year
wenet-e2e / wesr
View on GitHub
We Speech Transcript based on LLM, in 300 lines of code.
☆182Jun 20, 2025Updated last year
caizexin / GenVC
View on GitHub
Self-supervised Generative LM-based Voice Conversion
☆58Apr 24, 2025Updated last year
Mddct / WeUSM
View on GitHub
☆13Mar 30, 2023Updated 3 years ago
MuyangDu / T5Voice
View on GitHub
T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …
☆28Nov 7, 2025Updated 8 months ago
ZehuaKcrissLi / GTR-Voice
View on GitHub
☆16Nov 11, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
zqs01 / data2vecnoisy
View on GitHub
☆11Oct 20, 2022Updated 3 years ago
p1an-lin-jung / wv_tts
View on GitHub
☆19Mar 22, 2024Updated 2 years ago
lucadellalib / discrete-wavlm-codec
View on GitHub
A neural speech codec based on discrete WavLM representations
☆26Aug 28, 2024Updated last year
YUCHEN005 / DPSL-ASR
View on GitHub
Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"
☆44May 23, 2023Updated 3 years ago
the-bird-F / GLM-Voice-RAG
View on GitHub
[EMNLP 2025 Findings] A complete cross-modal RAG system for end-to-end speech-to-speech large models, including ASR-based Retrieval and E…
☆31Jul 11, 2025Updated last year
ssmlkl / MnTTS2
View on GitHub
This is the experimental description of MnTTS2.
☆12Apr 11, 2024Updated 2 years ago
shengcanxu / canoSpeech
View on GitHub
text to speech
☆10Mar 19, 2024Updated 2 years ago