Xianchao-Wu / wenet-deep-sparse-conformerLinks
☆15Updated 3 years ago
Alternatives and similar repositories for wenet-deep-sparse-conformer
Users that are interested in wenet-deep-sparse-conformer are comparing it to the libraries listed below
Sorting:
- kaldi cnn-tdnnf baseline☆13Updated 4 years ago
- ☆14Updated last year
- The project for speech translation☆11Updated last year
- Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"☆42Updated 2 years ago
- [ICASSP 2022] Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection☆25Updated 2 years ago
- Code for the Interspeech 2024 paper "MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting"☆36Updated 4 months ago
- One command to build TLG.fst for WeNet.☆31Updated 2 years ago
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆48Updated 8 months ago
- End-to-end MOdeling of ASR (Automatic Speech Recognition)☆33Updated 2 years ago
- [ICASSP2023] Source code, model links and open test sets for paper SeACo-Paraformer.☆33Updated last year
- ☆11Updated last year
- Optimized loss based on cross-entropy (CE), like MWER (minimum WER) Loss with beam search and negative sampling strategy, Smoothed Max Po…☆22Updated 11 months ago
- Went online decode demo☆31Updated 4 years ago
- [ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-…☆75Updated 8 months ago
- Repo for the FB AI Speech team.☆26Updated 4 years ago
- Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.☆60Updated 2 years ago
- ☆43Updated 2 years ago
- ☆15Updated last year
- ☆15Updated 3 years ago
- repository for paper "Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis"☆17Updated 3 years ago
- This repository contains code for applying Data2Vec to pretrain Keyword Transformer model as described in "Improving Label-Deficient Keyw…☆29Updated 6 months ago
- Speech samples and code of BEdit-TTS☆34Updated last year
- ☆29Updated 3 years ago
- ☆11Updated last year
- A fast parallel PyTorch implementation of the "CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition" https://arxiv.org/ab…☆33Updated last year
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆42Updated 2 years ago
- AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data☆32Updated last year
- ☆14Updated 3 years ago
- Streaming Audiotransformers for online Audio tagging☆47Updated last year
- End-to-End Speech Processing Toolkit☆16Updated 7 months ago