Xianchao-Wu / wenet-deep-sparse-conformerView external linksLinks
☆15Aug 25, 2022Updated 3 years ago
Alternatives and similar repositories for wenet-deep-sparse-conformer
Users that are interested in wenet-deep-sparse-conformer are comparing it to the libraries listed below
Sorting:
- ☆10Oct 16, 2025Updated 4 months ago
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- Evaluation of STT models for german language☆15Jan 22, 2022Updated 4 years ago
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆11Mar 14, 2025Updated 11 months ago
- ☆23Dec 6, 2025Updated 2 months ago
- kaldi cnn-tdnnf baseline☆13Aug 31, 2021Updated 4 years ago
- ☆14Nov 26, 2024Updated last year
- NAR-BERT-ASR☆10Sep 27, 2021Updated 4 years ago
- Speech recognition module for Python, supporting several engines and APIs, online and offline.☆13Mar 9, 2022Updated 3 years ago
- Code for Findings of ACL 2022 Paper "Sentiment Word Aware Multimodal Refinement for Multimodal Sentiment Analysis with ASR Errors"☆26Jun 15, 2022Updated 3 years ago
- ☆27Oct 25, 2024Updated last year
- [ICASSP2023] Source code, model links and open test sets for paper SeACo-Paraformer.☆39Mar 15, 2024Updated last year
- a repository for trainabale tts multi speaker☆14Nov 28, 2021Updated 4 years ago
- An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.☆12Jul 5, 2021Updated 4 years ago
- ☆14Jun 12, 2015Updated 10 years ago
- Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"☆16May 9, 2021Updated 4 years ago
- E2E ASR system☆14Oct 20, 2022Updated 3 years ago
- Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding☆11May 19, 2023Updated 2 years ago
- ☆16Nov 9, 2023Updated 2 years ago
- phone inventory library☆17May 15, 2023Updated 2 years ago
- Code for ACL-IJCNLP 2021 paper "N-Best-ASR-Transformer: Enhancing SLU Performance using Multiple ASR Hypotheses."☆17Nov 30, 2021Updated 4 years ago
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorch☆17Mar 11, 2022Updated 3 years ago
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆18Apr 17, 2024Updated last year
- Python package of MP-SENet from Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement.☆21Nov 1, 2024Updated last year
- AsoSoft Speech Corpus for Central-Kurdish Text-To-Speech☆19Jun 24, 2022Updated 3 years ago
- wake-up word emotion recognition [APSIPA 2022]☆17Nov 11, 2022Updated 3 years ago
- ☆18Mar 13, 2024Updated last year
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Mar 28, 2023Updated 2 years ago
- Open source cross-platform implementation of MRCP protocol☆20Mar 3, 2022Updated 3 years ago
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆22Dec 5, 2022Updated 3 years ago
- [SLT'24] Mamba-based Decoder-Only Approach for Speech Recognition☆18Dec 1, 2024Updated last year
- ☆15Jul 4, 2024Updated last year
- Error correction back-end for speaker diarization☆18Sep 26, 2023Updated 2 years ago
- ☆17Jul 22, 2024Updated last year
- [ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition☆219Jun 22, 2023Updated 2 years ago
- Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together☆46Jul 3, 2025Updated 7 months ago
- Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"☆43May 23, 2023Updated 2 years ago
- ☆22Jun 30, 2021Updated 4 years ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆43Nov 18, 2022Updated 3 years ago