PaddlePaddle / PaddleSpeechLinks
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
☆12,164Updated this week
Alternatives and similar repositories for PaddleSpeech
Users that are interested in PaddleSpeech are comparing it to the libraries listed below
Sorting:
- Production First and Production Ready End-to-End Speech Recognition Toolkit☆4,743Updated last month
- A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity…☆12,099Updated this week
- Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key☆8,848Updated 2 weeks ago
- A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统☆8,213Updated 10 months ago
- End-to-End Speech Processing Toolkit☆9,380Updated this week
- Multilingual Voice Understanding Model☆6,424Updated this week
- TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, Germa…☆3,962Updated last year
- VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech☆7,611Updated last year
- Awesome multilingual OCR and Document Parsing toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languag…☆52,675Updated this week
- Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)☆9,960Updated last year
- A PyTorch-based Speech Toolkit☆10,272Updated last week
- Easy-to-use and powerful LLM and SLM library with awesome model zoo.☆12,731Updated this week
- Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime…☆7,076Updated this week
- Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synth…☆3,060Updated last year
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆15,826Updated this week
- 🚀AI拟声: 5秒内克 隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time☆36,538Updated 9 months ago
- DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code☆4,573Updated 5 months ago
- kaldi-asr/kaldi is the official location of the Kaldi project.☆15,051Updated 3 weeks ago
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆42,107Updated last year
- 一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with su…☆7,264Updated 3 months ago
- An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"☆2,077Updated last year
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild☆7,120Updated last year
- A generative speech model for daily dialogue.☆37,552Updated last month
- Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, …☆1,452Updated this week
- This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion☆4,964Updated 6 months ago
- A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization☆2,285Updated last week
- Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junio…☆9,299Updated 2 months ago
- ☆1,445Updated last year
- 基于飞桨开发的虚拟主播☆1,069Updated 2 years ago
- 官方推荐的 ChatTTS 资源汇总项目,整理了全网相关资源和常见问题 || Officially recommended ChatTTS resource collection project☆1,764Updated last year