athena-team / athena
an open-source implementation of sequence-to-sequence based speech processing engine
☆949Updated last year
Related projects: ⓘ
- FSA/FST algorithms, differentiable, with PyTorch compatibility.☆1,108Updated last week
- 一个执着于让CPU\端侧-Model逼近GPU-Model性能的项目,CPU上的实时率(RTF)小于0.1☆460Updated last year
- Chinese text normalization for speech processing☆620Updated last year
- ☆894Updated last week
- A 10000+ hours dataset for Chinese speech recognition☆490Updated last year
- Production First and Production Ready End-to-End Keyword Spotting Toolkit☆436Updated last month
- The dataset of Speech Recognition☆382Updated 2 months ago
- a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi☆338Updated 3 years ago
- A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.☆768Updated last year
- Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit☆670Updated this week
- PAddle PARAllel text-to-speech toolKIT (supporting Tacotron2, Transformer TTS, FastSpeech2/FastPitch, SpeedySpeech, WaveFlow and Paralle…☆600Updated 2 years ago
- An Open Source Tools for Speaker Recognition☆590Updated last month
- Speech-to-text server framework with next-gen Kaldi☆524Updated this week
- Large, modern dataset for speech recognition☆629Updated 6 months ago
- SpeechIO Leaderboard: a large, robust, comprehensive, benchmarking platform for Automatic Speech Recognition.☆426Updated last week
- TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subw…☆928Updated 3 weeks ago
- The Implementation of FastSpeech based on pytorch.☆856Updated last year
- A No-Recurrence Sequence-to-Sequence Model for Speech Recognition☆372Updated 2 years ago
- Tools for handling speech data in machine learning projects.☆932Updated this week
- End-to-end ASR/LM implementation with PyTorch☆592Updated 3 years ago
- A Python wrapper for Kaldi☆991Updated last month
- Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei …☆460Updated 2 years ago
- speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆393Updated 4 years ago
- An Automatic Speech Recognition Frame ,一个中文语音识别的完整框架, 提供了多个模型☆244Updated 3 years ago
- Production First and Production Ready End-to-End Text-to-Speech Toolkit☆367Updated 3 months ago
- An End-to-End Architecture for Keyword Spotting and Voice Activity Detection☆373Updated last year
- speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition☆464Updated 3 years ago
- PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.☆576Updated 2 years ago
- Kaldi model converter to ONNX☆236Updated last year
- (已过时)中文语音合成,改自 https://github.com/Rayhane-mamah/Tacotron-2 和 https://github.com/begeekmyfriend/Tacotron-2☆300Updated 2 years ago