Tacotron speech synthesis implemented in TensorFlow, with samples and a pre-trained model
☆14Sep 13, 2017Updated 8 years ago
Alternatives and similar repositories for tacotron-3
Users that are interested in tacotron-3 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- methods_lidar_3d☆12Jan 15, 2022Updated 4 years ago
- ☆12Jan 15, 2019Updated 7 years ago
- 깃북 글 저장소☆13Dec 14, 2020Updated 5 years ago
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- Focal loss implemention by PyTorch☆11Dec 16, 2018Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- KITTI Point Cloud Utilities☆12Jul 25, 2024Updated last year
- Deep network for joint line and point detection and description☆20Aug 28, 2022Updated 3 years ago
- segmentation and classification of lidar point cloud data☆13Apr 29, 2019Updated 6 years ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆11May 14, 2025Updated 11 months ago
- Export MMD model in editor or runtime☆17Aug 3, 2025Updated 8 months ago
- A repository of ELL models☆21Jan 16, 2026Updated 3 months ago
- Public source code of top5 ZaloAI LandMark challenge.☆15Sep 6, 2018Updated 7 years ago
- A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …☆16Sep 5, 2017Updated 8 years ago
- Rababa, the diacritization library for Arabic and Hebrew (Abjad scripts in general)☆13May 1, 2025Updated 11 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Render wav and convert it with [Diff-SVC](https://github.com/prophesier/diff-svc) model☆10Aug 24, 2025Updated 7 months ago
- ☆15Jun 4, 2021Updated 4 years ago
- A utility to read and write PDFs with Python☆12Apr 28, 2022Updated 3 years ago
- Arabic Phonetic Dictionary Generator Tool for Automatic Speech Recognition Applications☆12Oct 27, 2021Updated 4 years ago
- 22人で童謡を5曲ずつ歌ってつくった歌唱データベースです。☆14Aug 7, 2022Updated 3 years ago
- Cherokee Audio data☆11Dec 24, 2023Updated 2 years ago
- phone inventory library☆17May 15, 2023Updated 2 years ago
- Build 2019 Demos for Knowledge Mining Session☆10May 17, 2019Updated 6 years ago
- Indonesian speech/phoneme recognizer powered by Kaldi 2.0 (lhotse, icefall, sherpa).☆15Jun 30, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A lightweight Rust library for removing Arabic diacritics☆20Oct 16, 2022Updated 3 years ago
- ☆13Dec 7, 2022Updated 3 years ago
- This is a legacy repo. Dev occurs now on GitHub.☆11Mar 28, 2021Updated 5 years ago
- Tools for the automatic detection of speech-related inhalation events and characterisation of the speech respiratory cycle.☆11Feb 17, 2024Updated 2 years ago
- A program to synthesize Japanese vocal music☆15Jan 24, 2017Updated 9 years ago
- ☆95Updated this week
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Oct 19, 2022Updated 3 years ago
- The grapheme to phoneme model converts Kazakh(Arab|Cyrillic) characters to phonemes.☆12Sep 30, 2019Updated 6 years ago
- Convert english/translit words to katakana☆13Sep 1, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- MFA acoustic model training based on Opencpop☆15Sep 23, 2022Updated 3 years ago
- ☆10Nov 14, 2016Updated 9 years ago
- HAMSI (Hessian Approximated Multiple Subsets Iteration) is a parallel incremental optimization algorithm☆13Feb 10, 2020Updated 6 years ago
- ☆14Oct 11, 2024Updated last year
- Doing style transfer with linguistic features using OpenAI's CLIP.☆14May 4, 2021Updated 4 years ago
- Thai smart home corpus with "Gowajee" hotword☆18Jul 30, 2023Updated 2 years ago
- ☆20Jul 22, 2022Updated 3 years ago