☆27Nov 3, 2025Updated 5 months ago
Alternatives and similar repositories for KaniTTS-Finetune-pipeline
Users that are interested in KaniTTS-Finetune-pipeline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX☆29Oct 15, 2024Updated last year
- ESLTTS dataset☆16Feb 6, 2025Updated last year
- Arabic Grapheme-to-Phoneme (G2P) Conversion☆13Mar 15, 2025Updated last year
- ☆458Nov 2, 2025Updated 5 months ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆16Jun 16, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors☆38Feb 11, 2025Updated last year
- ☆13Oct 27, 2025Updated 5 months ago
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆20May 20, 2025Updated 10 months ago
- Observer any UIScrollView without setting a delegate☆12Jun 5, 2025Updated 10 months ago
- Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.☆26Mar 17, 2025Updated last year
- ☆15Oct 11, 2023Updated 2 years ago
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.☆21Nov 19, 2024Updated last year
- Fine-tuning toolkit for Chatterbox TTS & Chatterbox TURBO models. Supports 23 languages with smart vocabulary extension. Features offline…☆96Feb 20, 2026Updated last month
- poorman's ar-dit tts☆45Dec 31, 2025Updated 3 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- An AR+AR TTS attempt.☆18Jan 13, 2025Updated last year
- ☆43Nov 19, 2025Updated 5 months ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30May 27, 2023Updated 2 years ago
- An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker …☆20Sep 17, 2024Updated last year
- Create svg sprites using svgson and react☆14Apr 24, 2023Updated 2 years ago
- dynamic alias, enhanced-resolve plugin☆11Mar 29, 2026Updated 3 weeks ago
- Visualizer to display the data logged with YarpRobotLoggerDevice☆32Feb 25, 2026Updated last month
- ☆25Mar 6, 2024Updated 2 years ago
- ☆26Mar 20, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code accompanying the paper "A Language Model's Guide Through Latent Space". It contains functionality for training and using concept vec…☆21Feb 23, 2024Updated 2 years ago
- NVIDIA Parakeet speech recognition for the browser (WebGPU/WASM) powered by ONNX Runtime Web☆54Updated this week
- Official Repository for Task-Circuit Quantization☆24Jun 1, 2025Updated 10 months ago
- Get an answer to a question from multiple backend engine like Google, wolframalpha or DuckDuckGo☆11Dec 9, 2020Updated 5 years ago
- ☆46Mar 11, 2026Updated last month
- Audiobook creation tool supporting multiple TTS models (Qwen3-TTS, IndexTTS2, VibeVoice, Chatterbox, Fish S2-Pro, Higgs Audio V2, etc), f…☆100Updated this week
- SoTA open-source TTS☆26Jul 8, 2025Updated 9 months ago
- ☆68Dec 30, 2025Updated 3 months ago
- FastAPI Implementation of Orpheus TTS streaming Chatbot☆28Jun 19, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- My vocoder experiments☆31Jul 26, 2025Updated 8 months ago
- ☆24Sep 28, 2020Updated 5 years ago
- Local runner for Microsoft VibeVoice Realtime TTS Fully compatible with Open-Webui Plug and Play. OpenAI api endpoint .Run the Colab note…☆40Mar 13, 2026Updated last month
- ☆10Oct 24, 2024Updated last year
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Oct 29, 2022Updated 3 years ago
- A simple, but performant framework for mapping speech directly to categories and intents.☆25Aug 8, 2024Updated last year
- [AAAI 2026 & ACL 2026] The official implementation of the DIFFA series for dLLM-based large audio language model☆76Apr 7, 2026Updated last week