NabuCasa / voice-datasets
Public voice datasets used for our Text-to-Speech voices.
☆30Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for voice-datasets
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆24Updated last year
- TTS Client for Coqui TTS server☆13Updated last year
- Coqui AI TTS plugin☆69Updated 2 months ago
- Streamlit app to visualize and edit TTS datasets☆14Updated 2 years ago
- ☆20Updated 2 months ago
- A free & open tool for transcribing audio interviews with offline ASR support☆24Updated 11 months ago
- Convert native orthographies to the International Phonetic Alphabet☆13Updated 2 years ago
- Interface for using TTS and vocoder models in the form of a text editor☆19Updated 2 years ago
- Versatile AI-driven audio upscaler to enhance the quality of any audio.☆60Updated 2 months ago
- A universal version of TheLastBen's fast-stable-diffusion - no longer maintained☆11Updated last month
- Platform to transcribe Youtube links☆12Updated 3 months ago
- SEPIA server to support open-source speech recognition via WebSocket connection.☆120Updated 2 weeks ago
- C++ library for converting text to phonemes for Piper☆89Updated 8 months ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- Open Voice OS container images and docker-compose.yml files for x86_64 and aarch64 CPU architectures.☆41Updated 3 weeks ago
- 🐸Coqui Dialogue Audio Pack contains more than 2000 audio files of synthetic human voices over dialogue created specifically for video ga…☆39Updated last year
- A fast MP3 decoder for python, using minimp3☆26Updated 2 years ago
- Voice models for Mimic 3 text to speech system☆131Updated 5 months ago
- Creates video from TTS output and viseme images.☆11Updated 2 years ago
- ☆87Updated 6 months ago
- Automatically generate and overlay subtitles for any video using OpenAi Whisper☆16Updated 2 years ago
- ☆18Updated 2 years ago
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆24Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆45Updated 2 weeks ago
- A simple extension that uses Bark Text-to-Speech for audio output☆35Updated last year
- Text prompt steered synthetic audio generators☆45Updated 11 months ago
- Whatsapp Web Speech To Text☆53Updated last year
- streaming speech to text server using Whisper☆83Updated last year
- ☆68Updated 8 months ago
- 💬 "Realtime" voice transcription and cloning using ElevenLabs's API.☆49Updated last year