OHF-Voice / voice-datasetsLinks
Public voice datasets used for our Text-to-Speech voices.
☆46Updated 7 months ago
Alternatives and similar repositories for voice-datasets
Users that are interested in voice-datasets are comparing it to the libraries listed below
Sorting:
- Voice models for Mimic 3 text to speech system☆161Updated last year
- A fast, local neural text to speech system☆17Updated 11 months ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆26Updated 2 years ago
- Interface for using TTS and vocoder models in the form of a text editor☆19Updated 2 months ago
- Coqui AI TTS plugin☆85Updated 6 months ago
- Faster distil-whisper transcription with CTranslate2☆14Updated 2 years ago
- Toolkit for training/converting LibreTranslate compatible language models 🚂☆77Updated 7 months ago
- Provides Piper neural text-to-speech voices as a browser extension☆72Updated 3 weeks ago
- TTS Client for Coqui TTS server☆13Updated 3 years ago
- Karaoke Player / Editor with automatic clip creation from any song file using vocals and lyrics extraction (Speech-to-Text)☆87Updated 2 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- Local voice recording for creating Piper datasets☆201Updated 7 months ago
- A deep-learning powered accessibility application which turns pdfs into audio files. Featuring ocr improvement and tts with inflection!☆25Updated 11 months ago
- SEPIA server to support open-source speech recognition via WebSocket connection.☆135Updated last year
- A free & open tool for transcribing audio interviews with offline ASR support☆25Updated 2 years ago
- Neural Text to speech model that is a perfect voice for a home assistant, audiobooks or for screen readers on Linux, Mac and Windows. A f…☆40Updated 2 years ago
- ez audio transcription tool with flexible processing and post-processing options☆161Updated last year
- streaming speech to text server using Whisper☆101Updated 2 years ago
- A highly optimized engine for neutts-air model to generate minutes of audio in seconds. Over 200x realtime on modern hardware!☆105Updated 2 months ago
- On-device streaming text-to-speech engine powered by deep learning☆127Updated this week
- Piper Tray is a lightweight system tray utility written in C# for use with Piper TTS.☆32Updated 2 months ago
- C++ library for converting text to phonemes for Piper☆137Updated 6 months ago
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆257Updated last year
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆161Updated last year
- Creates video from TTS output and viseme images.☆15Updated 3 years ago
- Open models for Coqui STT☆149Updated 2 years ago
- Wake word detection with custom phrases without model training☆24Updated 5 months ago
- A tokenizer, text cleaner, and phonemizer for many human languages.☆332Updated last year
- ☆170Updated last year
- zero-shot realtime TTS system, fully offline, free and open source☆50Updated 9 months ago