coqui-ai / coqui-voice-packLinks
πΈCoqui Dialogue Audio Pack contains more than 2000 audio files of synthetic human voices over dialogue created specifically for video games. The pack includes both male and female voices from >30 different voices, and all of the files can be used for commercial purposes (royalty free).
β42Updated 2 years ago
Alternatives and similar repositories for coqui-voice-pack
Users that are interested in coqui-voice-pack are comparing it to the libraries listed below
Sorting:
- πΈ - A general purpose model trainer, as flexible as it getsβ227Updated last year
- Coqui AI TTS pluginβ87Updated 4 months ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.β54Updated 11 months ago
- Conversational Language model toolkit for training against human preferences.β42Updated last year
- β75Updated last year
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime portβ¦β25Updated 2 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.β68Updated last month
- π« check your data, before you wreck your modelβ16Updated 3 years ago
- Fork of AudioLDM as a TuneFlow pluginβ41Updated 2 years ago
- convert a saved pytorch model to gguf and generate as much corresponding ggml c code as possibleβ15Updated last year
- π π€ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningβ160Updated last year
- β62Updated last year
- A huggingface pipeline to train a gpt model based on the transcript obtained byt the Open AI whisper modelβ17Updated 2 years ago
- A python library to find differences between audio and transcriptionsβ19Updated 2 years ago
- A curated list of awesome OpenAI's Whisperβ98Updated 2 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zooβ25Updated 2 years ago
- β83Updated last year
- Implementation of Sesame's Conversational Speech Model for Hugging Face Transformersβ57Updated 6 months ago
- πΈTTS recipes for different datasetsβ86Updated 3 years ago
- π Text-prompted Generative Audio Model - With the ability to clone voicesβ19Updated 2 years ago
- text-to-audio-latent-diffusionβ37Updated 2 years ago
- Joint speech-language model - respond directly to audio!β30Updated last year
- A cog implementation of MosaicML's MPT-7B-StoryWriter-65k+ Large Language Modelβ57Updated 2 years ago
- Sing an idea β‘οΈ AI music sampleπ₯πΆβ118Updated last year
- Experimental sampler to make LLMs more creativeβ31Updated 2 years ago
- β18Updated 3 years ago
- Voxtral: Convert Mistral into a end2end SpeechLM. No information bottleneck, preserves prosody, learns interruptions from data. Unlike GPβ¦β36Updated 8 months ago
- C++ library for converting text to phonemes for Piperβ134Updated 4 months ago
- Heteronym to Phoneme Parserβ18Updated 2 years ago
- Site for sharing Bark voicesβ51Updated 7 months ago