afiaka87 / tortoise-ttsLinks
A multi-voice TTS system trained with an emphasis on quality
β136Updated 3 years ago
Alternatives and similar repositories for tortoise-tts
Users that are interested in tortoise-tts are comparing it to the libraries listed below
Sorting:
- TTS with The Massively Multilingual Speech (MMS) projectβ235Updated last year
- π Text-prompted Generative Audio Modelβ235Updated 2 years ago
- Fast TorToiSe inference (5x or your money back!)β825Updated last year
- Site for sharing Bark voicesβ51Updated 5 months ago
- Full python wrapper for the elevenlabs API.β158Updated 2 months ago
- Audio datasets, easier.β84Updated 2 years ago
- Oobabooga extension for Bark TTSβ118Updated last year
- β149Updated 2 years ago
- Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes sβ¦β52Updated last year
- Fine tune SDXL on YouTube videosβ176Updated last year
- π BARK INFINITY GUI CMD πΆ Powered Up Bark Text-prompted Generative Audio Modelβ1,011Updated last year
- Let us control diffusion modelsβ203Updated 2 years ago
- XTTSv2 Extension for oobabooga text-generation-webuiβ155Updated last year
- β101Updated last year
- so-vits-svcβ179Updated last month
- β87Updated 2 years ago
- The code for the bark-voicecloning model. Training and inference.β704Updated last year
- Add caption to any videoβ202Updated last year
- β111Updated 2 years ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressorβ¦β122Updated 2 months ago
- A curated list of amazing RunPod projects, libraries, and resourcesβ120Updated last year
- Basic framework for training Dreambooth Stable Diffusion v1.5 on Banana's v1.0 serverless GPU platformβ37Updated 2 years ago
- Diffusers / Stable Diffusion in docker with a REST API, supporting various models, pipelines & schedulers.β202Updated last year
- TorToiSe fine-tuning with DLASβ224Updated last year
- Transcription with speaker diarization pipelineβ94Updated 2 years ago
- Kandinsky 2 β multilingual text2image latent diffusion modelβ87Updated last year
- One-click launcher for Audiocraft MusicGen + AudioGen Gradio Web UIβ69Updated 2 years ago
- Long-Inference, High Quality Synthetic Speaker (AI avatar/ AI presenter)β263Updated 2 years ago
- β166Updated 2 years ago
- GradioUI for TortoiseTTS voice generationβ34Updated last year