This comprehensive guide provides a universal process for preparing your own speech datasets and training a custom Text-to-Speech (TTS) model.
☆31May 3, 2025Updated last year
Alternatives and similar repositories for Universal-TTS-Guide
Users that are interested in Universal-TTS-Guide are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Analysis of XLS-R for Speech Quality Assessment☆15Feb 10, 2025Updated last year
- temporary files created by opensubtitles-scraper☆17Feb 3, 2026Updated 4 months ago
- (WIP) A retrain of F5-TTS on permissively-licensed data☆14Apr 6, 2025Updated last year
- Text Normalization utilities for normalizing text for TTS☆22Mar 4, 2026Updated 3 months ago
- Lightblue LLM Eval Framework: tengu, elyza100, ja-mtbench, rakuda☆18Apr 29, 2026Updated last month
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Llama-Mimi is a speech language model that uses a unified tokenizer (Mimi) and a single Transformer decoder (Llama) to jointly model sequ…☆31Sep 20, 2025Updated 8 months ago
- ☆11Feb 22, 2025Updated last year
- dvbshout takes an MPEG transport stream from a DVB card, extracts audio channels from stream, and sends the audio to an Icecast / Shoutca…☆10Jul 29, 2021Updated 4 years ago
- KV Cache & LoRA for minGPT☆63Mar 4, 2026Updated 3 months ago
- ☆50Apr 20, 2026Updated last month
- Code for "Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations" [NAACL Findings 2024]☆14Apr 3, 2026Updated 2 months ago
- poorman's ar-dit tts☆45Dec 31, 2025Updated 5 months ago
- Full End-to-End examples showing how to use First-gen Gaudi and Gaudi2 in common use cases☆13Dec 2, 2024Updated last year
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆21May 20, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- CLIP is an open source, multimodal computer vision model and it's awesome!☆17Dec 16, 2024Updated last year
- A demo of presence cursors with json0 and ShareDB.☆13Apr 16, 2019Updated 7 years ago
- Repository for NLP project. Name to be changed when we decide on a project☆16Apr 19, 2022Updated 4 years ago
- ☆33Feb 6, 2026Updated 4 months ago
- a character-ai like UI for LLM☆10Dec 3, 2024Updated last year
- Building and Running TypeScript projects efficiently with rollup + esbuild☆21Updated this week
- Distributed Reinforcement Learning for LLM Fine-Tuning with multi-GPU utilization☆22Mar 12, 2025Updated last year
- Port of Nordic Semiconductor nRF24L01 transceiver's Hardware Abstraction Layer API (NRF HAL) for the Arduino platform.☆11Jan 18, 2023Updated 3 years ago
- ☆33Sep 22, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- The MOS system combines components from DNSMOS, NISQA, MOSSSL, and SIGMOS, using the librosa library to process audio waveforms.☆31Feb 16, 2024Updated 2 years ago
- Repository for ACM India Summer School on Generative AI for Text☆13Jul 11, 2024Updated last year
- A PHP RQL Parsing Library☆17Jan 28, 2025Updated last year
- ☆15Aug 26, 2023Updated 2 years ago
- Iterate fast on your RAG pipelines☆24Jun 21, 2025Updated 11 months ago
- 🚀 Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.☆38Feb 27, 2025Updated last year
- ComfyUI-HiggsAudio is now available in ComfyUI, Higgs Audio v2 is a text-audio foundation model from Boson AI.☆23Jul 26, 2025Updated 10 months ago
- Jeff's phrasing system for Plover☆44Apr 21, 2024Updated 2 years ago
- Generate Drizzle schema from Prisma schema☆35Aug 13, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- MCP web research server (give Claude real-time info from the web)☆25Feb 18, 2025Updated last year
- ☆23May 6, 2026Updated last month
- convert number to arabic words☆19Mar 7, 2023Updated 3 years ago
- ☆37Feb 5, 2025Updated last year
- Roles (similar to traits) in C#☆22Dec 24, 2017Updated 8 years ago
- ☆44Jun 10, 2024Updated 2 years ago
- A comprehensive ComfyUI wrapper for HiggsAudio v2, enabling high-quality text-to-speech generation with advanced voice cloning capabiliti…☆27Jul 26, 2025Updated 10 months ago