coqui-ai / coqui-voice-packLinks
πΈCoqui Dialogue Audio Pack contains more than 2000 audio files of synthetic human voices over dialogue created specifically for video games. The pack includes both male and female voices from >30 different voices, and all of the files can be used for commercial purposes (royalty free).
β42Updated 2 years ago
Alternatives and similar repositories for coqui-voice-pack
Users that are interested in coqui-voice-pack are comparing it to the libraries listed below
Sorting:
- Coqui AI TTS pluginβ87Updated 3 months ago
- β74Updated last year
- A python library to find differences between audio and transcriptionsβ19Updated last year
- πΈ - A general purpose model trainer, as flexible as it getsβ226Updated last year
- π π€ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningβ157Updated last year
- Experimental sampler to make LLMs more creativeβ31Updated 2 years ago
- Run Vision LLMs, TTS and STT APIs. Website and API for https://text-generator.ioβ36Updated last month
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.β54Updated 10 months ago
- convert a saved pytorch model to gguf and generate as much corresponding ggml c code as possibleβ15Updated last year
- β99Updated last year
- A curated list of awesome OpenAI's Whisperβ98Updated 2 years ago
- Dynamic parameter modulation for oobabooga's text-generation-webui that adjusts generation parameters to better mirror user affect.β34Updated 2 years ago
- Conversational Language model toolkit for training against human preferences.β41Updated last year
- Code for OpenAI Whisper Web App Demoβ93Updated 3 years ago
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to pβ¦β52Updated 3 years ago
- Sing an idea β‘οΈ AI music sampleπ₯πΆβ117Updated last year
- text-to-audio-latent-diffusionβ37Updated 2 years ago
- β18Updated 3 years ago
- Auto-Video maker handling many AI'sβ10Updated last year
- β83Updated last year
- β20Updated 2 months ago
- Fork of AudioLDM as a TuneFlow pluginβ41Updated 2 years ago
- β62Updated last year
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).β45Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.β68Updated last month
- Voxtral: Convert Mistral into a end2end SpeechLM. No information bottleneck, preserves prosody, learns interruptions from data. Unlike GPβ¦β35Updated 7 months ago
- Site for sharing Bark voicesβ51Updated 6 months ago
- BlinkDL's RWKV-v4 running in the browserβ46Updated 2 years ago
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)β71Updated 2 years ago
- C++ library for converting text to phonemes for Piperβ134Updated 3 months ago