PaddlePaddle / PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
☆11,119Updated this week
Related projects ⓘ
Alternatives and complementary repositories for PaddleSpeech
- Production First and Production Ready End-to-End Speech Recognition Toolkit☆4,160Updated this week
- VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech☆6,829Updated 11 months ago
- A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity…☆6,853Updated this week
- End-to-End Speech Processing Toolkit☆8,470Updated last week
- Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provi…☆44,039Updated this week
- TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, Germa…☆3,836Updated 4 months ago
- A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统☆7,831Updated last month
- DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code☆4,303Updated last year
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆35,253Updated 2 months ago
- SoftVC VITS Singing Voice Conversion☆25,827Updated 11 months ago
- Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key☆6,189Updated last week
- 👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial…☆12,100Updated this week
- ModelScope: bring the notion of Model-as-a-Service to life.☆6,984Updated this week
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆35,365Updated this week
- Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)☆9,381Updated last year
- http://www.facegood.cc☆1,819Updated last year
- This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion☆4,744Updated 4 months ago
- PAddle PARAllel text-to-speech toolKIT (supporting Tacotron2, Transformer TTS, FastSpeech2/FastPitch, SpeedySpeech, WaveFlow and Paralle…☆599Updated 2 years ago
- Awesome pre-trained models toolkit based on PaddlePaddle. (400+ models including Image, Text, Audio, Video and Cross-Modal with Easy Infe…☆12,721Updated 3 months ago
- This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Mult…☆10,722Updated last week
- 基于飞桨开发的虚拟主播☆1,042Updated last year
- A PyTorch-based Speech Toolkit☆8,884Updated this week
- 基于PaddlePaddle实现的语音识别,中文语音识别。项目完善,识别效果好。支持Windows,Linux下训练和预测,支持Nvidia Jetson开发板预测。☆677Updated this week
- 🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time☆35,268Updated last week
- Easily train a good VC model with voice data <= 10 mins!☆24,296Updated 2 months ago
- Clone a voice in 5 seconds to generate arbitrary speech in real-time☆52,641Updated 2 months ago
- vits2 backbone with multilingual-bert☆7,972Updated this week
- ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型☆40,649Updated 4 months ago
- PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)☆22,238Updated this week