zhejianglab / Data-Processing-Toolkit-for-LLMs

☆14

Related projects ⓘ

Alternatives and complementary repositories for Data-Processing-Toolkit-for-LLMs

jctian98 / e2e_lfmmi
E2E system with LF-MMI; word N-gram for Mandarin
☆163Updated 2 years ago
MagicHub-io / CSASR_Challenge
☆11Updated 2 years ago
double22a / asr_nlp_paper_code
Papers of ASR, Tools of ASR
☆38Updated last year
MingLunHan / CIF-HieraDist
[INTERSPEECH 2023] Knowledge Transfer from Pre-trained Language Models to Cif-based Recognizers via Hierarchical Distillation
☆35Updated last year
THUsatlab / BERT-LID
Leveraging BERT to Improve Spoken Language Identification
☆14Updated last year
billzyx / awesome-dementia-detection
Paper list of dementia detection
☆24Updated last month
felixfuyihui / AISHELL-4
☆119Updated 3 years ago
qinxiaoyi / Simple-Attention-Module-based-Speaker-Verification-with-Iterative-Noisy-Label-Detection
☆13Updated 2 years ago
MingLunHan / CIF-PyTorch
[ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-…
☆67Updated last year
lenovo-voice / THE-2020-PERSONALIZED-VOICE-TRIGGER-CHALLENGE-BASELINE-SYSTEM
☆51Updated 3 years ago
KeSpeech / KeSpeech
The repo provides information about KeSpeech dataset.
☆113Updated 2 years ago
NKU-HLT / RAMP_MOS
Retrieval-Augmented MOS Prediction with Prior Knowledge Integration
☆11Updated 2 weeks ago
YiwenShaoStephen / pychain
PyTorch implementation of LF-MMI for End-to-end ASR
☆216Updated 3 years ago
thuhcsi / SECap
☆138Updated 4 months ago
tzyll / kaldi
☆107Updated 3 years ago
qinxiaoyi / Cross-Age_Speaker_Verification
☆25Updated 2 years ago
zeroQiaoba / ivector-xvector
Extract xvector and ivector under kaldi
☆109Updated 5 years ago
lawlict / ECAPA-TDNN
☆98Updated 3 years ago
phonexiaresearch / VBx-training-recipe
☆29Updated 2 years ago
bliunlpr / Robust_e2e_gan
PyTorch implementation of "Jointly Adversarial Enhancement Training for Robust End-to-End Speech Recognition"
☆19Updated 5 years ago
SpeechColab / GigaSpeech2
An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement
☆116Updated last week
upskyy / Transformer-Transducer
PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASS…
☆101Updated 2 years ago
MrSupW / ICMC-ASR_Baseline
The baseline system for the ICASSP2024 ICMC-ASR Challenge.
☆46Updated 11 months ago
mycrazycracy / speaker-embedding-with-phonetic-information
The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"
☆43Updated 5 years ago
Liu-Feng-deeplearning / TTS-frontend
TTS-frontend with Bert and CRF/lstm (For Tacotron)
☆50Updated 4 years ago
XiaoMi / dasheng
Official PyTorch code for Deep Audio-Signal Holistic Embeddings
☆55Updated last month
ductuantruong / enskd
Official implementation of the ICASSP 2024 paper: Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Speaker Verificat…
☆16Updated 7 months ago
tencent-ailab / 3m-asr
3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition
☆118Updated 2 years ago
yufan-aslp / AliMeeting
The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to pro…
☆113Updated 2 years ago