TigreGotico / chatterbox-onnxLinks
chatterbox TTS + Voice Clone using onnx
☆26Updated last month
Alternatives and similar repositories for chatterbox-onnx
Users that are interested in chatterbox-onnx are comparing it to the libraries listed below
Sorting:
- StyleTTS2 + Vocos as a Decoder☆13Updated 8 months ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15Updated 7 months ago
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆16Updated 6 months ago
- (WIP) A retrain of F5-TTS on permissively-licensed data☆13Updated 8 months ago
- Arabic Grapheme-to-Phoneme (G2P) Conversion☆13Updated 9 months ago
- StyleTTS 2 Optimized Training Fork☆34Updated 10 months ago
- ☆50Updated last week
- A composition of offline tools to achieve high quality multilingual speech to text transcription☆23Updated last week
- Open TTS models, built for streaming on the edge☆44Updated 8 months ago
- ☆19Updated 9 months ago
- Audio tokenization, in the fastest way possible!☆53Updated last year
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.☆127Updated 2 months ago
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆14Updated last month
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆129Updated 4 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆69Updated last month
- Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX☆29Updated last year
- LoRA-based phoneme/prosody control for LLM-based TTS with no G2P - Lightweight adapter for edit and control the target language's phoneme…☆21Updated 4 months ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆23Updated 4 months ago
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.☆19Updated last year
- Very fast, accurate speaker diarization☆186Updated last week
- ☆29Updated last month
- ☆47Updated 5 months ago
- A collection of all our phonemeizers for dataset construction and inference☆27Updated 9 months ago
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆16Updated last year
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆22Updated last year
- Supervoice diffusion enhance☆27Updated last year
- Speaker diarization service☆25Updated 5 months ago
- A lightweight Python library for running TTS models with a unified API.☆21Updated 9 months ago
- ☆17Updated 4 years ago
- Trying to build an all in one speech-text language model - a bit like GPT-4o☆22Updated last year