This comprehensive guide provides a universal process for preparing your own speech datasets and training a custom Text-to-Speech (TTS) model.
☆28May 3, 2025Updated last year
Alternatives and similar repositories for Universal-TTS-Guide
Users that are interested in Universal-TTS-Guide are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Unofficial instructions for changing Python kernel version on Google Colab.☆25Apr 21, 2025Updated last year
- Analysis of XLS-R for Speech Quality Assessment☆15Feb 10, 2025Updated last year
- temporary files created by opensubtitles-scraper☆17Feb 3, 2026Updated 3 months ago
- (WIP) A retrain of F5-TTS on permissively-licensed data☆14Apr 6, 2025Updated last year
- Text Normalization utilities for normalizing text for TTS☆22Mar 4, 2026Updated 2 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Llama-Mimi is a speech language model that uses a unified tokenizer (Mimi) and a single Transformer decoder (Llama) to jointly model sequ…☆31Sep 20, 2025Updated 8 months ago
- ☆11Feb 22, 2025Updated last year
- dvbshout takes an MPEG transport stream from a DVB card, extracts audio channels from stream, and sends the audio to an Icecast / Shoutca…☆10Jul 29, 2021Updated 4 years ago
- KV Cache & LoRA for minGPT☆63Mar 4, 2026Updated 2 months ago
- ☆49Apr 20, 2026Updated last month
- Code for "Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations" [NAACL Findings 2024]☆14Apr 3, 2026Updated last month
- poorman's ar-dit tts☆45Dec 31, 2025Updated 4 months ago
- Full End-to-End examples showing how to use First-gen Gaudi and Gaudi2 in common use cases☆13Dec 2, 2024Updated last year
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆21May 20, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- CLIP is an open source, multimodal computer vision model and it's awesome!☆17Dec 16, 2024Updated last year
- Repository for NLP project. Name to be changed when we decide on a project☆16Apr 19, 2022Updated 4 years ago
- ☆33Feb 6, 2026Updated 3 months ago
- a character-ai like UI for LLM☆10Dec 3, 2024Updated last year
- Building and Running TypeScript projects efficiently with rollup + esbuild☆21May 14, 2026Updated last week
- Distributed Reinforcement Learning for LLM Fine-Tuning with multi-GPU utilization☆22Mar 12, 2025Updated last year
- Port of Nordic Semiconductor nRF24L01 transceiver's Hardware Abstraction Layer API (NRF HAL) for the Arduino platform.☆11Jan 18, 2023Updated 3 years ago
- Fancy scala wrapper for Apache POI☆24May 8, 2011Updated 15 years ago
- ☆33Sep 22, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- arduino library for Honeywell pressure sensors☆12May 4, 2020Updated 6 years ago
- The MOS system combines components from DNSMOS, NISQA, MOSSSL, and SIGMOS, using the librosa library to process audio waveforms.☆31Feb 16, 2024Updated 2 years ago
- Repository for ACM India Summer School on Generative AI for Text☆13Jul 11, 2024Updated last year
- Simple gaze tracking in python that uses an gradient-based algorithm by Timm & Barth to locate iris centers☆11Jul 16, 2015Updated 10 years ago
- A PHP RQL Parsing Library☆17Jan 28, 2025Updated last year
- REMI 3 "All-in-One" electronic wind instrument (EWI) - based on Teensy 3.2 MCU - Arduino IDE☆16Dec 18, 2023Updated 2 years ago
- Iterate fast on your RAG pipelines☆24Jun 21, 2025Updated 11 months ago
- 🚀 Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.☆38Feb 27, 2025Updated last year
- ComfyUI-HiggsAudio is now available in ComfyUI, Higgs Audio v2 is a text-audio foundation model from Boson AI.☆23Jul 26, 2025Updated 9 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Triton implementation of GPT/LLAMA☆21Aug 28, 2024Updated last year
- Syntax Highlightning for Textarea (HTML). Transform Textarea into code editor with shortcut keys supported.☆20Jun 9, 2023Updated 2 years ago
- ☆14Aug 6, 2021Updated 4 years ago
- MCP web research server (give Claude real-time info from the web)☆25Feb 18, 2025Updated last year
- Blog plugin for Winter CMS☆20Feb 8, 2026Updated 3 months ago
- convert number to arabic words☆19Mar 7, 2023Updated 3 years ago
- An MSBuild CodeTaskFactory that uses Roslyn compiler for cross platform compatibility☆26Aug 20, 2018Updated 7 years ago