afiaka87 / tortoise-ttsLinks
A multi-voice TTS system trained with an emphasis on quality
β134Updated 2 years ago
Alternatives and similar repositories for tortoise-tts
Users that are interested in tortoise-tts are comparing it to the libraries listed below
Sorting:
- π Text-prompted Generative Audio Modelβ235Updated 2 years ago
- Audio datasets, easier.β84Updated last year
- TTS with The Massively Multilingual Speech (MMS) projectβ233Updated last year
- Oobabooga extension for Bark TTSβ119Updated last year
- XTTSv2 Extension for oobabooga text-generation-webuiβ154Updated last year
- The code for the bark-voicecloning model. Training and inference.β703Updated last year
- Add caption to any videoβ198Updated last year
- Fast TorToiSe inference (5x or your money back!)β827Updated last year
- Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes sβ¦β52Updated last year
- Site for sharing Bark voicesβ51Updated 3 months ago
- Let us control diffusion modelsβ202Updated 2 years ago
- Fine tune SDXL on YouTube videosβ174Updated 10 months ago
- Simplified installers for suno-ai/bark, musicgen, tortoise, RVC, demucs and vocosβ46Updated last year
- β149Updated 2 years ago
- β101Updated 10 months ago
- A curated list of amazing RunPod projects, libraries, and resourcesβ117Updated 10 months ago
- Kandinsky 2 β multilingual text2image latent diffusion modelβ87Updated last year
- Full python wrapper for the elevenlabs API.β157Updated last month
- Transcription with speaker diarization pipelineβ94Updated 2 years ago
- Diffusers / Stable Diffusion in docker with a REST API, supporting various models, pipelines & schedulers.β203Updated last year
- Long-Inference, High Quality Synthetic Speaker (AI avatar/ AI presenter)β261Updated 2 years ago
- β111Updated last year
- βοΈ | REPLACED BY https://github.com/runpod-workers | Official set of serverless worker provided by RunPod as endpoints.β60Updated last month
- π BARK INFINITY GUI CMD πΆ Powered Up Bark Text-prompted Generative Audio Modelβ1,010Updated last year
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressorβ¦β122Updated 3 weeks ago
- A front-end GUI for interacting with the AI Horde / Stable Diffusion distributed clusterβ182Updated 2 weeks ago
- β89Updated 2 years ago
- β97Updated last year
- Basic framework for training Dreambooth Stable Diffusion v1.5 on Banana's v1.0 serverless GPU platformβ37Updated 2 years ago
- The code for some apps built with Sieve.β81Updated 7 months ago