wenet-e2e / wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
☆4,268Updated last week
Alternatives and similar repositories for wenet:
Users that are interested in wenet are comparing it to the libraries listed below
- ☆979Updated last week
- An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"☆1,906Updated last year
- FSA/FST algorithms, differentiable, with PyTorch compatibility.☆1,147Updated 2 weeks ago
- End-to-End Speech Processing Toolkit☆8,686Updated this week
- 基于PaddlePaddle实现端到端中文语音识别,从入门到实战,超简单的入门案例,超实用的企业项目。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型☆832Updated last week
- chinese speech pretrained models☆1,058Updated 4 months ago
- 一个执着于让CPU\端侧-Model逼近GPU-Model性能的项目,CPU上的实时率(RTF)小于0.1☆469Updated 3 months ago
- Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit☆799Updated last week
- Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。☆630Updated last week
- an open-source implementation of sequence-to-sequence based speech processing engine☆929Updated 2 years ago
- Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training wit…☆936Updated 3 weeks ago
- Speech-to-text server framework with next-gen Kaldi☆587Updated this week
- ☆1,405Updated 11 months ago
- TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, Germa…☆3,876Updated 6 months ago
- Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text fronten…☆11,391Updated this week
- A 10000+ hours dataset for Chinese speech recognition☆512Updated last year
- Self-Supervised Speech Pre-training and Representation Learning Toolkit☆2,310Updated 2 weeks ago
- Production First and Production Ready End-to-End Keyword Spotting Toolkit☆491Updated 5 months ago
- A PyTorch-based Speech Toolkit☆9,189Updated this week
- pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,…☆2,374Updated 2 years ago
- Tools for handling speech data in machine learning projects.☆972Updated 3 weeks ago
- TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subw…☆953Updated this week
- SpeechIO Leaderboard: a large, robust, comprehensive, benchmarking platform for Automatic Speech Recognition.☆463Updated last week
- Command line utility for forced alignment using Kaldi☆1,388Updated last month
- Text Normalization & Inverse Text Normalization☆512Updated 2 months ago
- Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!☆1,173Updated 11 months ago
- Chinese text normalization for speech processing☆645Updated last year
- The dataset of Speech Recognition☆399Updated 3 weeks ago
- Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synth…☆3,007Updated last year
- Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.☆689Updated last year