pengchengguo / espnet
End-to-End Speech Processing Toolkit
☆9Updated 5 months ago
Alternatives and similar repositories for espnet:
Users that are interested in espnet are comparing it to the libraries listed below
- ☆31Updated 2 years ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆39Updated 2 years ago
- repository for paper "Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis"☆16Updated 2 years ago
- Production first, nn-based on-device signal processing toolkit.☆64Updated last year
- The official repo of UL-UNAS, an ultra-lightweight SE model.☆38Updated last month
- ☆38Updated 8 months ago
- Went online decode demo☆29Updated 3 years ago
- The baseline system for the ICASSP2024 ICMC-ASR Challenge.☆50Updated last year
- List of NN based singal processing papers☆20Updated last year
- Utilizes ONNX Runtime for audio denoising.☆44Updated this week
- Code for the Interspeech 2024 paper "MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting"☆27Updated 4 months ago
- 为音频加混响的代码☆26Updated last year
- Tiny Transducer: A Highly-Efficient Speech Recognition Model on Edge Devices☆23Updated 2 years ago
- simple dnn based vad☆70Updated 6 years ago
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆51Updated 9 months ago
- eran-shahar / Double-talk-Detection-aided-Residual-Echo-Suppression-via-Spectrogram-Masking-and-Refinement☆26Updated 2 years ago
- Keyword spotting, Speech wake_up, by pytorch, DNN, CNN, TDNN, DFSMN, LSTM☆47Updated 3 years ago
- We design a spectral compression mapping (SCM) for full-band speech enhancement, and propose a two-stage stream named MHA-DPCRN☆24Updated 2 years ago
- ☆13Updated last year
- 达摩fsmn vad c++推理服务☆13Updated 2 years ago
- The official repo of "HGCN: Harmonic Gated Compensation Network For Speech Enhancement"☆57Updated 3 years ago
- This is a mandarin version of speech separation dataset like WSJMix and LibriMix☆11Updated 2 years ago
- Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation…☆61Updated 8 months ago
- Huawei Grad-TTS for Chinese☆50Updated last year
- ☆45Updated 2 years ago
- ☆26Updated 2 years ago
- The implementation of TaylorBeamformer, which is in submission to Interspeech2022☆40Updated 2 years ago
- A list of papers for child ASR☆39Updated 6 months ago
- ☆50Updated 4 years ago
- multi-scale time domain speaker extraction☆62Updated 3 years ago