OHF-Voice / voice-datasetsLinks
Public voice datasets used for our Text-to-Speech voices.
☆45Updated 6 months ago
Alternatives and similar repositories for voice-datasets
Users that are interested in voice-datasets are comparing it to the libraries listed below
Sorting:
- Voice models for Mimic 3 text to speech system☆158Updated last year
- Coqui AI TTS plugin☆85Updated 5 months ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated 2 years ago
- ez audio transcription tool with flexible processing and post-processing options☆160Updated last year
- C++ library for converting text to phonemes for Piper☆135Updated 5 months ago
- A fast, local neural text to speech system☆16Updated 9 months ago
- zero-shot realtime TTS system, fully offline, free and open source☆48Updated 7 months ago
- Provides Piper neural text-to-speech voices as a browser extension☆69Updated last week
- streaming speech to text server using Whisper☆98Updated 2 years ago
- Interface for using TTS and vocoder models in the form of a text editor☆19Updated 2 weeks ago
- On-device streaming text-to-speech engine powered by deep learning☆121Updated 3 weeks ago
- An even smaller speech recognizer / force aligner☆37Updated 11 months ago
- web based editor for subtitles and transcripts☆141Updated last year
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- ☆100Updated last year
- SEPIA server to support open-source speech recognition via WebSocket connection.☆134Updated last year
- Faster distil-whisper transcription with CTranslate2☆14Updated last year
- Turn any common eBook file into an HQ Audiobook with F5-TTS (Easy Install)☆29Updated this week
- A random walk voice style cloning application for Kokoro text to speech☆184Updated 5 months ago
- A simple, but performant framework for mapping speech directly to categories and intents.☆22Updated last year
- Local voice recording for creating Piper datasets☆188Updated 6 months ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆255Updated last year
- A highly optimized engine for neutts-air model to generate minutes of audio in seconds. Over 200x realtime on modern hardware!☆57Updated 2 weeks ago
- Karaoke Player / Editor with automatic clip creation from any song file using vocals and lyrics extraction (Speech-to-Text)☆82Updated 2 years ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆121Updated 2 years ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆41Updated this week
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆161Updated last year
- ☕🇧🇷 Scripts para o Kaldi em Português Brasileiro☆59Updated 3 years ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆52Updated last year