neonbjb / tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
☆13,707Updated 3 months ago
Alternatives and similar repositories for tortoise-tts:
Users that are interested in tortoise-tts are comparing it to the libraries listed below
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆37,794Updated 6 months ago
- 🔊 Text-Prompted Generative Audio Model☆36,988Updated 6 months ago
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models☆5,465Updated 6 months ago
- 🔊 Text-prompted Generative Audio Model - With the ability to clone voices☆3,245Updated 8 months ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/☆7,788Updated last year
- so-vits-svc fork with realtime support, improved interface and more features.☆8,894Updated this week
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.☆4,542Updated 10 months ago
- An unofficial PyTorch implementation of the audio LM VALL-E☆2,985Updated last year
- Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)☆9,636Updated last year
- Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and creat…☆24,484Updated this week
- [CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation☆12,341Updated 7 months ago
- Stable diffusion for real-time music generation☆3,538Updated 6 months ago
- Faster Whisper transcription with CTranslate2☆14,234Updated last month
- StableLM: Stability AI Language Models☆15,831Updated 10 months ago
- An Open Source text-to-speech system built by inverting Whisper.☆4,120Updated 2 months ago
- A Gradio web UI for Large Language Models with support for multiple inference backends.☆42,540Updated this week
- This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Mult…☆11,367Updated last week
- Community interface for generative AI☆8,930Updated 9 months ago
- OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamical…☆37,238Updated 6 months ago
- State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.☆3,598Updated last year
- Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple☆5,144Updated last year
- TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS, Stable Audio, Mars5,…☆2,016Updated this week
- Foundational model for human-like, expressive TTS☆4,035Updated 6 months ago
- Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch☆2,502Updated last month
- AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head☆10,096Updated 7 months ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆21,505Updated last month
- ☆7,751Updated 10 months ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆13,990Updated this week
- Port of OpenAI's Whisper model in C/C++☆37,876Updated this week
- A fast, local neural text to speech system☆7,906Updated 4 months ago