huismiling / wenet_trt8View external linksLinks
☆75Jun 27, 2022Updated 3 years ago
Alternatives and similar repositories for wenet_trt8
Users that are interested in wenet_trt8 are comparing it to the libraries listed below
Sorting:
- ☆70Dec 9, 2022Updated 3 years ago
- Open Source Speech/Text Data on AI☆19Sep 13, 2022Updated 3 years ago
- A SPMI Lab toolkit for language models.☆11Apr 12, 2017Updated 8 years ago
- magicspeech competition recipe☆18Jun 29, 2020Updated 5 years ago
- ☆19Mar 22, 2024Updated last year
- TensorRT 2022复赛方案: 首个基于Transformer的图像重建模型MST++的TensorRT模型推断优化☆143Jul 6, 2022Updated 3 years ago
- ☆40Aug 15, 2021Updated 4 years ago
- ☆15Apr 2, 2025Updated 10 months ago
- Tacotron text to speech in C++(synthesize only)☆77Oct 17, 2019Updated 6 years ago
- ☆34Jul 16, 2019Updated 6 years ago
- ☆32Oct 28, 2022Updated 3 years ago
- Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…☆61Apr 4, 2024Updated last year
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Jul 16, 2020Updated 5 years ago
- faster inference☆28Jan 20, 2025Updated last year
- streaming attention networks for end-to-end automatic speech recognition☆55May 6, 2020Updated 5 years ago
- Multispeaker Community Vocoder Model for DiffSinger☆39Aug 11, 2025Updated 6 months ago
- 这是一个用C++实现ASR推理的项目,它依赖很少,安装也很简单,推理速度很快,在树莓派4B等ARM平台也可以流畅的运行。 支持的模型是由Google的Transformer模型中优化而来,数据集是开源wenetspeech(10000+小时)或阿里私有数据集(60000+小…☆545Mar 19, 2023Updated 2 years ago
- ☆76Mar 18, 2022Updated 3 years ago
- ☆33Nov 29, 2022Updated 3 years ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆36Jan 17, 2024Updated 2 years ago
- ☆33Jun 29, 2023Updated 2 years ago
- ☆147Aug 2, 2020Updated 5 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Aug 31, 2020Updated 5 years ago
- ☆276Jan 15, 2021Updated 5 years ago
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆91Feb 18, 2025Updated 11 months ago
- Real-time melgan based on cpu !!!☆13Dec 3, 2019Updated 6 years ago
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆114Dec 2, 2025Updated 2 months ago
- ☆56Jul 17, 2023Updated 2 years ago
- ☆23Oct 17, 2024Updated last year
- PyTorch implementation of LF-MMI for End-to-end ASR☆220Jan 14, 2021Updated 5 years ago
- Google's TPGST reimplementation.☆34Dec 11, 2019Updated 6 years ago
- An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.☆12Jul 5, 2021Updated 4 years ago
- ☆13Oct 27, 2021Updated 4 years ago
- LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models☆25Aug 11, 2024Updated last year
- Based on https://github.com/fatchord/WaveRNN☆24May 3, 2020Updated 5 years ago
- ICASSP 2024 - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.☆55Nov 16, 2025Updated 2 months ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Oct 28, 2019Updated 6 years ago
- The RWTH ASR Toolkit.☆58Updated this week
- Towards hot directions in industrial end to end speech recognition☆331Nov 30, 2021Updated 4 years ago