Fine-tuning toolkit for Chatterbox TTS & Chatterbox TURBO models. Supports 23 languages with smart vocabulary extension. Features offline preprocessing, automatic VAD trimming, and voice cloning capabilities. Train custom TTS models with your own dataset in LJSpeech and file-based format.
β79Feb 20, 2026Updated 2 weeks ago
Alternatives and similar repositories for chatterbox-finetuning
Users that are interested in chatterbox-finetuning are comparing it to the libraries listed below
Sorting:
- β26Nov 3, 2025Updated 4 months ago
- SpeechPlus: Small LLM-Based Text-to-Speech Library πβ20May 20, 2025Updated 9 months ago
- Real-time voice conversation system with Sesame CSM, featuring web-based audio visualization and GPU acceleration. Educational implementaβ¦β18Mar 18, 2025Updated 11 months ago
- High quality text-to-speech based on StyleTTS 2.β73Feb 25, 2026Updated last week
- This is an unofficial implementation of universal melgan according to https://arxiv.org/abs/2011.09631β23Aug 15, 2022Updated 3 years ago
- Official implementation of "Unsupervised Pre-training for Data-Efficient Text-to-Speech on Low Resource Languages", ICASSP 2023β27Apr 27, 2023Updated 2 years ago
- Code for reproducing the paper "Neural Networks Fail to Learn Periodic Functions and How to Fix It" as part of the ML Reproducibility Chaβ¦β11Apr 16, 2021Updated 4 years ago
- SoTA open-source TTSβ136Jun 7, 2025Updated 9 months ago
- Code to reproduce the experiments from the paper "Self-Compatibility: Evaluating Causal Discovery without Ground Truth"β12Mar 9, 2024Updated 2 years ago
- Demo for DART, Audio Imagination workshop submission in NeurIPS 2024β12Apr 15, 2025Updated 10 months ago
- Original VinVL visual backbone with simplified APIs to easily extract features, boxes, object detections, in a few lines of Python code.β11Nov 27, 2022Updated 3 years ago
- β12Jun 29, 2025Updated 8 months ago
- VS Code tools for NextBASICβ12Apr 22, 2025Updated 10 months ago
- Examples of using PyTorch hooks, as covered in my YouTube tutorial video.β38Oct 25, 2023Updated 2 years ago
- β10Apr 8, 2024Updated last year
- Character-level Recurrent Neural Network Language Model (rnnlm) implement in Pytorch.β12Oct 4, 2020Updated 5 years ago
- Reproduction of the paper SFSRNet: Super-resolution for single-channel Audio Source Separation by me (@arda-num) and @dritx16. Navigate Pβ¦β11Jul 7, 2022Updated 3 years ago
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoderβ12Mar 11, 2025Updated 11 months ago
- QuadTree Compression for ComputerCraft Videosβ15Jan 4, 2022Updated 4 years ago
- Simple Pygame application for Particle Effect showcasing (Tutorial)β10Nov 11, 2023Updated 2 years ago
- Arabic Grapheme-to-Phoneme (G2P) Conversionβ13Mar 15, 2025Updated 11 months ago
- Get an answer to a question from multiple backend engine like Google, wolframalpha or DuckDuckGoβ11Dec 9, 2020Updated 5 years ago
- This Python script retrieves and analyzes scientific literature from PubMed related to specific genes, creates a word cloud visualizationβ¦β11Nov 15, 2023Updated 2 years ago
- Logo detection in images using SSDβ10Jul 13, 2018Updated 7 years ago
- MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognitionβ11Dec 4, 2021Updated 4 years ago
- text to speechβ10Mar 19, 2024Updated last year
- A straightforward implementation for Progressive Growing of GANsβ10Jun 20, 2018Updated 7 years ago
- Embedded Tajweed annotation for the Qur'anβ11Nov 30, 2025Updated 3 months ago
- code for Towards Data Science article on prompt-loss-weightβ11Jun 4, 2025Updated 9 months ago
- β41Nov 19, 2025Updated 3 months ago
- A collection of Z80 assembler related projects developed as both an educational and personal resource.β14May 16, 2018Updated 7 years ago
- Onset-and-Offset-Aware Sound Event Detectionβ21Feb 10, 2025Updated last year
- Joint magnitude estimation and phase recovery using Cycle-in-Cycle GAN for non-parallel speech enhancementβ10Jan 24, 2022Updated 4 years ago
- A 3D library for the ZX Spectrum Nextβ27Nov 30, 2025Updated 3 months ago
- MelGAN and Tacotron 2 in PyTorchβ11Oct 22, 2019Updated 6 years ago
- TEAD : Large Scale Arabic Dataset for Sentiment Analysisβ12Oct 16, 2018Updated 7 years ago
- TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loudβ¦β111Dec 20, 2024Updated last year
- β14Aug 1, 2025Updated 7 months ago
- β11Oct 29, 2019Updated 6 years ago