PaddlePaddle / PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
☆11,506Updated this week
Alternatives and similar repositories for PaddleSpeech:
Users that are interested in PaddleSpeech are comparing it to the libraries listed below
- Production First and Production Ready End-to-End Speech Recognition Toolkit☆4,308Updated this week
- A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity…☆8,229Updated this week
- A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统☆7,980Updated 4 months ago
- An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"☆1,943Updated last year
- This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Mult…☆11,367Updated last week
- End-to-End Speech Processing Toolkit☆8,791Updated 2 weeks ago
- VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech☆7,151Updated last year
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆37,794Updated 6 months ago
- 🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time☆35,802Updated 3 months ago
- kaldi-asr/kaldi is the official location of the Kaldi project.☆14,567Updated 3 weeks ago
- 基于PaddlePaddle实现端到端中文语音识别,从入门到实战,超简单的入门案例,超实用的企业项目。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型☆838Updated last week
- Facebook AI Research's Automatic Speech Recognition Toolkit☆6,406Updated 2 months ago
- Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)☆9,636Updated last year
- TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, Germa…☆3,892Updated 7 months ago
- Silero VAD: pre-trained enterprise-grade Voice Activity Detector☆5,026Updated this week
- 中文语音识别; Mandarin Automatic Speech Recognition;☆1,910Updated 6 months ago
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild☆6,881Updated 6 months ago
- A PyTorch-based Speech Toolkit☆9,371Updated last week
- Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synth…☆3,015Updated last year
- PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html☆2,089Updated last year
- Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provi…☆46,558Updated this week
- ☆1,014Updated 2 weeks ago
- Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key☆7,396Updated 2 weeks ago
- 🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.☆2,346Updated 11 months ago
- [CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation☆12,341Updated 7 months ago
- A treasure chest for visual classification and recognition powered by PaddlePaddle☆5,556Updated last week
- pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,…☆2,378Updated 2 years ago
- http://www.facegood.cc☆1,851Updated 2 years ago
- PAddle PARAllel text-to-speech toolKIT (supporting Tacotron2, Transformer TTS, FastSpeech2/FastPitch, SpeedySpeech, WaveFlow and Paralle…☆599Updated 3 years ago
- Python interface to the WebRTC Voice Activity Detector☆2,145Updated 7 months ago