10jqka-aicubes / code-switching-contestLinks

基于单语种语料的中英混合语音识别算法-同花顺算法挑战赛-2021年9-10月双月赛

☆14

Alternatives and similar repositories for code-switching-contest

Users that are interested in code-switching-contest are comparing it to the libraries listed below

Sorting:

VKW2021 / kaldi-baseline
kaldi cnn-tdnnf baseline
☆13Updated 3 years ago
jiay7 / wenet_onlinedecode
Went online decode demo
☆30Updated 4 years ago
NKU-HLT / KNN-CTC
[ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels
☆39Updated last year
R1ckShi / SeACo-Paraformer
[ICASSP2023] Source code, model links and open test sets for paper SeACo-Paraformer.
☆32Updated last year
pengzhendong / welm
One command to build TLG.fst for WeNet.
☆31Updated 2 years ago
tzyll / ChineseHP
☆14Updated last year
SSTC-Challenge / SSTC2024_baseline_system
☆10Updated last year
yuhangear / wenet-android
☆12Updated 3 years ago
Liangzheng-ZL / BEdit-TTS
Speech samples and code of BEdit-TTS
☆33Updated last year
mubingshen / MLC-SLM-Baseline
The project is associated with the recently-launched INTERSPEECH 2025 Workshop on Multilingual Conversational Speech Language Model (MLC-…
☆39Updated 2 months ago
yucongzh / online_speaker_diarization
☆14Updated 3 years ago
Ephrem-ETH / E2E-KWS
End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM
☆41Updated 2 years ago
datemoon / ASR-decoder
it's ASR decoder and make graph project
☆32Updated 3 years ago
shaojinding / Adversarial-Many-to-Many-VC
[InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …
☆39Updated 2 years ago
frank613 / CTC-based-GOP
This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024
☆23Updated 8 months ago
Slyne / ctc_decoder
A ctc decoder for both online and offline asr model
☆64Updated last year
janson9192 / autokws2021
☆13Updated 4 years ago
mispchallenge / MISP2021-AVSR
repository for paper "Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis"
☆17Updated 3 years ago
snsun / kaldi-decoder-code-reading
☆32Updated 2 years ago
LeonWlw / asr_blockformer
E2E ASR system
☆14Updated 2 years ago
Zeqiang-Lai / Prosody_Prediction
Predict prosody labels for Chinese sentences.
☆41Updated 3 years ago
Mddct / cosyvoice2-flow-optimized
faster inference
☆28Updated 6 months ago
gteu / realtime-ppg-vc
Voice conversion model for real-time speech synthesis using PPG (Phonetic PosteriorGram) as an intermediate feature, written in Pytorch.
☆28Updated 3 years ago
BakerBunker / SALT
[ASRU 2023] Code of paper SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation
☆20Updated 11 months ago
MagicHub-io / MagicData-RAMC
MagicData-RAMC Dataset and Baseline
☆54Updated 2 years ago
pkufool / simple-wer
A simple command line tool to calculate WER for ASR.
☆14Updated 9 months ago
prairie-schooner / wav2vec-vc
☆11Updated 2 years ago
csukuangfj / kaldi-hmm-gmm
☆25Updated 9 months ago
ductuantruong / speaker_age_estimation_ssl_study
Official implementation of the APSIPA 2022 paper: Exploring Speaker Age Estimation on Different Self-Supervised Learning Models
☆14Updated 2 years ago
thu-spmi / SPMILM
A SPMI Lab toolkit for language models.
☆11Updated 8 years ago