This comprehensive guide provides a universal process for preparing your own speech datasets and training a custom Text-to-Speech (TTS) model.
☆32Jun 17, 2026Updated 2 weeks ago
Alternatives and similar repositories for Universal-TTS-Guide
Users that are interested in Universal-TTS-Guide are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Analysis of XLS-R for Speech Quality Assessment☆15Feb 10, 2025Updated last year
- temporary files created by opensubtitles-scraper☆17Feb 3, 2026Updated 4 months ago
- Text Normalization utilities for normalizing text for TTS☆22Mar 4, 2026Updated 3 months ago
- Llama-Mimi is a speech language model that uses a unified tokenizer (Mimi) and a single Transformer decoder (Llama) to jointly model sequ…☆31Sep 20, 2025Updated 9 months ago
- ☆11Feb 22, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- dvbshout takes an MPEG transport stream from a DVB card, extracts audio channels from stream, and sends the audio to an Icecast / Shoutca…☆10Jul 29, 2021Updated 4 years ago
- KV Cache & LoRA for minGPT☆61Mar 4, 2026Updated 3 months ago
- ☆50Apr 20, 2026Updated 2 months ago
- Code for "Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations" [NAACL Findings 2024]☆14Apr 3, 2026Updated 2 months ago
- Full End-to-End examples showing how to use First-gen Gaudi and Gaudi2 in common use cases☆13Dec 2, 2024Updated last year
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆21May 20, 2025Updated last year
- CLIP is an open source, multimodal computer vision model and it's awesome!☆17Dec 16, 2024Updated last year
- ☆34Feb 6, 2026Updated 4 months ago
- a character-ai like UI for LLM☆10Dec 3, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Building and Running TypeScript projects efficiently with rollup + esbuild☆21Jun 19, 2026Updated last week
- extension for pdfmake☆13Jul 5, 2016Updated 9 years ago
- Distributed Reinforcement Learning for LLM Fine-Tuning with multi-GPU utilization☆22Mar 12, 2025Updated last year
- Fancy scala wrapper for Apache POI☆24May 8, 2011Updated 15 years ago
- ☆33Sep 22, 2024Updated last year
- Electronic saxophone firmware and PCB☆17Feb 22, 2024Updated 2 years ago
- The MOS system combines components from DNSMOS, NISQA, MOSSSL, and SIGMOS, using the librosa library to process audio waveforms.☆31Feb 16, 2024Updated 2 years ago
- Repository for ACM India Summer School on Generative AI for Text☆13Jul 11, 2024Updated last year
- Simple gaze tracking in python that uses an gradient-based algorithm by Timm & Barth to locate iris centers☆11Jul 16, 2015Updated 10 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- REMI 3 "All-in-One" electronic wind instrument (EWI) - based on Teensy 3.2 MCU - Arduino IDE☆16Dec 18, 2023Updated 2 years ago
- ☆15Aug 26, 2023Updated 2 years ago
- Iterate fast on your RAG pipelines☆24Jun 21, 2025Updated last year
- 🚀 Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.☆38Feb 27, 2025Updated last year
- ComfyUI-HiggsAudio is now available in ComfyUI, Higgs Audio v2 is a text-audio foundation model from Boson AI.☆23Jul 26, 2025Updated 11 months ago
- Arduino library for communicating with Honeywell TruStability HSC or SSC digital pressure sensors over SPI☆15Nov 20, 2019Updated 6 years ago
- Syntax Highlightning for Textarea (HTML). Transform Textarea into code editor with shortcut keys supported.☆20Jun 9, 2023Updated 3 years ago
- Triton implementation of GPT/LLAMA☆22Aug 28, 2024Updated last year
- convert number to arabic words☆19Mar 7, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆37Feb 5, 2025Updated last year
- ☆44Jun 10, 2024Updated 2 years ago
- A comprehensive ComfyUI wrapper for HiggsAudio v2, enabling high-quality text-to-speech generation with advanced voice cloning capabiliti…☆27Jul 26, 2025Updated 11 months ago
- .NET 8 Aurelia App with Bootstrap☆21Nov 21, 2023Updated 2 years ago
- A PHP Search Query Parser☆25Nov 26, 2015Updated 10 years ago
- Streamlit Web UI for OCRmyPDF☆56May 26, 2023Updated 3 years ago
- Use AI to preview how garments look on you directly on product pages from Amazon and Coupang. Upload your photo, click "Try On," and see …☆23Apr 13, 2025Updated last year