gokhaneraslan / chatterbox-finetuningView external linksLinks
Fine-tuning toolkit for Chatterbox TTS & Chatterbox TURBO models. Supports 23 languages with smart vocabulary extension. Features offline preprocessing, automatic VAD trimming, and voice cloning capabilities. Train custom TTS models with your own dataset in LJSpeech and file-based format.
β70Jan 11, 2026Updated last month
Alternatives and similar repositories for chatterbox-finetuning
Users that are interested in chatterbox-finetuning are comparing it to the libraries listed below
Sorting:
- β23Nov 3, 2025Updated 3 months ago
- SpeechPlus: Small LLM-Based Text-to-Speech Library πβ20May 20, 2025Updated 8 months ago
- Real-time voice conversation system with Sesame CSM, featuring web-based audio visualization and GPU acceleration. Educational implementaβ¦β18Mar 18, 2025Updated 11 months ago
- High quality text-to-speech based on StyleTTS 2.β72Updated this week
- This is an unofficial implementation of universal melgan according to https://arxiv.org/abs/2011.09631β23Aug 15, 2022Updated 3 years ago
- Official implementation of "Unsupervised Pre-training for Data-Efficient Text-to-Speech on Low Resource Languages", ICASSP 2023β27Apr 27, 2023Updated 2 years ago
- Code for reproducing the paper "Neural Networks Fail to Learn Periodic Functions and How to Fix It" as part of the ML Reproducibility Chaβ¦β11Apr 16, 2021Updated 4 years ago
- β40Nov 19, 2025Updated 2 months ago
- Original VinVL visual backbone with simplified APIs to easily extract features, boxes, object detections, in a few lines of Python code.β11Nov 27, 2022Updated 3 years ago
- Demo for DART, Audio Imagination workshop submission in NeurIPS 2024β12Apr 15, 2025Updated 10 months ago
- VS Code tools for NextBASICβ12Apr 22, 2025Updated 9 months ago
- β12Jun 29, 2025Updated 7 months ago
- Code to reproduce the experiments from the paper "Self-Compatibility: Evaluating Causal Discovery without Ground Truth"β12Mar 9, 2024Updated last year
- code for Towards Data Science article on prompt-loss-weightβ11Jun 4, 2025Updated 8 months ago
- Simple Pygame application for Particle Effect showcasing (Tutorial)β10Nov 11, 2023Updated 2 years ago
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoderβ12Mar 11, 2025Updated 11 months ago
- Arabic Grapheme-to-Phoneme (G2P) Conversionβ13Mar 15, 2025Updated 11 months ago
- Character-level Recurrent Neural Network Language Model (rnnlm) implement in Pytorch.β12Oct 4, 2020Updated 5 years ago
- TEAD : Large Scale Arabic Dataset for Sentiment Analysisβ12Oct 16, 2018Updated 7 years ago
- Logo detection in images using SSDβ10Jul 13, 2018Updated 7 years ago
- text to speechβ10Mar 19, 2024Updated last year
- A user-friendly interface built on top of Thinking Machines Tinker API that lets you fine-tune LLMs, chat with your trained model, and deβ¦β26Jan 31, 2026Updated 2 weeks ago
- MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognitionβ11Dec 4, 2021Updated 4 years ago
- Joint magnitude estimation and phase recovery using Cycle-in-Cycle GAN for non-parallel speech enhancementβ10Jan 24, 2022Updated 4 years ago
- MelGAN and Tacotron 2 in PyTorchβ11Oct 22, 2019Updated 6 years ago
- A 3D library for the ZX Spectrum Nextβ26Nov 30, 2025Updated 2 months ago
- A collection of Z80 assembler related projects developed as both an educational and personal resource.β14May 16, 2018Updated 7 years ago
- Reproduction of the paper SFSRNet: Super-resolution for single-channel Audio Source Separation by me (@arda-num) and @dritx16. Navigate Pβ¦β11Jul 7, 2022Updated 3 years ago
- A straightforward implementation for Progressive Growing of GANsβ10Jun 20, 2018Updated 7 years ago
- QuadTree Compression for ComputerCraft Videosβ15Jan 4, 2022Updated 4 years ago
- β10Apr 8, 2024Updated last year
- Get an answer to a question from multiple backend engine like Google, wolframalpha or DuckDuckGoβ11Dec 9, 2020Updated 5 years ago
- Onset-and-Offset-Aware Sound Event Detectionβ20Feb 10, 2025Updated last year
- This Python script retrieves and analyzes scientific literature from PubMed related to specific genes, creates a word cloud visualizationβ¦β11Nov 15, 2023Updated 2 years ago
- TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loudβ¦β111Dec 20, 2024Updated last year
- β11Mar 22, 2023Updated 2 years ago
- FINALLY: Fast and universal speech enhancement model delivering studio-quality audio for a wide range of recordings.β25Dec 11, 2025Updated 2 months ago
- Automatically exported from code.google.com/p/transducersaurusβ11Apr 1, 2015Updated 10 years ago
- β14Aug 1, 2025Updated 6 months ago