gheyret / uyghur-asr-transformerLinks
Speech Recognition for Uyghur using Speech transformer
☆27Updated 4 years ago
Alternatives and similar repositories for uyghur-asr-transformer
Users that are interested in uyghur-asr-transformer are comparing it to the libraries listed below
Sorting:
- Speech Recognition for Uyghur using deep learning☆42Updated 4 years ago
- A enterprise-grade Chinese-English code switch punctuator from funasr.☆29Updated last year
- Python Wrapper of Silero VAD☆62Updated 7 months ago
- Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"☆43Updated 2 years ago
- Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"☆16Updated 4 years ago
- Went online decode demo☆31Updated 4 years ago
- Rank 7th/1817 in the 2018 iFLYTEK AI Developer Challenge with acc 0.82 for the ten Chinese dialects classification task, this code was p…☆13Updated 2 years ago
- Code for the Interspeech 2023 paper "A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-t…☆24Updated 2 years ago
- 将normalize过的中文文本,做逆向normalize。具体功能即实现 chinese_text_normalization的逆向版本。☆13Updated 4 years ago
- A library for adding punctuation into a text from ASR.☆19Updated 2 years ago
- [ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech…☆56Updated 5 years ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆42Updated 3 years ago
- 端到端语音识别实现;包含LAS、CTC、RNNT解码方式,模型SA(MHA)、LSTM、CNN、DFSMN等☆15Updated 4 years ago
- 说话人识别(声纹识别)算法的Python实现。包括GMM(已完成)、GMM-UBM、ivector、基于深度学习的声纹识别(self-attention已完成)。☆104Updated 2 years ago
- ☆96Updated last year
- Papers of ASR, Tools of ASR☆41Updated 10 months ago
- E2E ASR system☆14Updated 3 years ago
- MagicData-RAMC Dataset and Baseline☆56Updated 3 years ago
- Code for the Interspeech 2024 paper "MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting"☆41Updated 7 months ago
- ☆12Updated last year
- ☆41Updated 2 months ago
- fast SpecAugmentation code with numpy and scipy☆31Updated 6 years ago
- The repo provides information about KeSpeech dataset.☆164Updated 3 years ago
- Optimized loss based on cross-entropy (CE), like MWER (minimum WER) Loss with beam search and negative sampling strategy, Smoothed Max Po…☆23Updated last year
- The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to pro…☆129Updated 3 years ago
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆96Updated 4 years ago
- [Tiny KWS] SparkNet: Sparse Binarization for Fast Keyword Spotting☆16Updated 3 months ago
- ☆18Updated last year
- 基于Kaldi的小词汇量汉语语音识别,使用DNN训练☆27Updated 6 years ago
- repository for paper "Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis"☆18Updated 3 years ago