Tzenthin / wenet_mnnView external linksLinks
语音识别模型pytorch转ONNX转MNN,C++实现部署
☆83Sep 1, 2022Updated 3 years ago
Alternatives and similar repositories for wenet_mnn
Users that are interested in wenet_mnn are comparing it to the libraries listed below
Sorting:
- ☆40Aug 15, 2021Updated 4 years ago
- ☆33Aug 6, 2021Updated 4 years ago
- An upgrade framework for train and validate compare with icefall using Lightning.☆15Mar 26, 2025Updated 10 months ago
- ☆32Oct 28, 2022Updated 3 years ago
- ☆29Feb 4, 2025Updated last year
- 这是一个用C++实现ASR推理的项目,它依赖很少,安装也很简单,推理速度很快,在树莓派4B等ARM平台也可以流畅的运行。 支持的模型是由Google的Transformer模型中优化而来,数据集是 开源wenetspeech(10000+小时)或阿里私有数据集(60000+小…☆545Mar 19, 2023Updated 2 years ago
- ☆28Oct 7, 2025Updated 4 months ago
- Went online decode demo☆31Apr 28, 2021Updated 4 years ago
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated 10 months ago
- 将normalize过的中文文本,做逆向normalize。具体功能即实现 chinese_text_normalization的逆向版本。☆13Apr 7, 2021Updated 4 years ago
- ☆55Jan 13, 2023Updated 3 years ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15May 16, 2025Updated 9 months ago
- some ncnn demos of FunASR☆28Sep 23, 2024Updated last year
- E2E TTS using Conditional Flow Matching (Experimental*)☆71Nov 10, 2023Updated 2 years ago
- faster inference☆28Jan 20, 2025Updated last year
- A Tiny Project For ASR model training and Deployment☆26Oct 14, 2022Updated 3 years ago
- KWS demo based on CTC prefix beam search.☆17Oct 21, 2023Updated 2 years ago
- Megatts2 use HierSpeechpp's vocoder☆18Dec 2, 2024Updated last year
- One command to build TLG.fst for WeNet.☆30Oct 11, 2022Updated 3 years ago
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated last year
- Minimize kaldi nnet3 chain decoder☆45Jan 10, 2020Updated 6 years ago
- simple dnn based vad☆70Dec 2, 2018Updated 7 years ago
- 单独维护的中文TTS☆34Oct 28, 2022Updated 3 years ago
- A cross platform implementation of Text-to-Speech based on ONNXRuntime.☆32May 10, 2023Updated 2 years ago
- A library for adding punctuation into a text from ASR.☆19May 8, 2023Updated 2 years ago
- 来自于文章Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition☆27Nov 20, 2024Updated last year
- ☆16Jun 13, 2022Updated 3 years ago
- Open Source Speech/Text Data on AI☆19Sep 13, 2022Updated 3 years ago
- 使用ONNXRuntime部署LSTR基于Transformer的端到端实时车道线检测,包含C++和Python两个版本的程序☆21Jan 27, 2023Updated 3 years ago
- magicspeech competition recipe☆18Jun 29, 2020Updated 5 years ago
- simple energy vad☆19Jun 3, 2017Updated 8 years ago
- 使用OpenCV部署CoupledTPS,包含了肖像矫正,不规则边界的图像矩形化,旋转图像矫正,三个模型。依然是包含C++和Python两个版本的程序☆20Jul 4, 2024Updated last year
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆127Apr 26, 2023Updated 2 years ago
- singing voice conversion without f0☆23May 10, 2023Updated 2 years ago
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆62May 6, 2023Updated 2 years ago
- Torch Audio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.☆62Sep 5, 2025Updated 5 months ago
- Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.☆59Sep 6, 2023Updated 2 years ago
- ☆23Oct 17, 2024Updated last year
- g2p for english tts☆19Nov 10, 2022Updated 3 years ago