coqui-ai / coqui-voice-packLinks
πΈCoqui Dialogue Audio Pack contains more than 2000 audio files of synthetic human voices over dialogue created specifically for video games. The pack includes both male and female voices from >30 different voices, and all of the files can be used for commercial purposes (royalty free).
β42Updated 2 years ago
Alternatives and similar repositories for coqui-voice-pack
Users that are interested in coqui-voice-pack are comparing it to the libraries listed below
Sorting:
- Coqui AI TTS pluginβ85Updated 5 months ago
- Sing an idea β‘οΈ AI music sampleπ₯πΆβ119Updated last year
- πΈ - A general purpose model trainer, as flexible as it getsβ230Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.β69Updated last month
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.β53Updated last year
- β18Updated 3 years ago
- C++ library for converting text to phonemes for Piperβ137Updated 5 months ago
- β75Updated last year
- π π€ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningβ161Updated last year
- Code for OpenAI Whisper Web App Demoβ93Updated 3 years ago
- β107Updated 2 years ago
- Engage in conversation with your virtual self using AI techniques like NLP, voice cloning, and computer vision. Get accurate answers withβ¦β84Updated 2 years ago
- A python library to find differences between audio and transcriptionsβ19Updated 2 years ago
- Interface for using TTS and vocoder models in the form of a text editorβ19Updated 3 weeks ago
- Site for sharing Bark voicesβ51Updated 8 months ago
- convert a saved pytorch model to gguf and generate as much corresponding ggml c code as possibleβ15Updated 2 years ago
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)β72Updated 2 years ago
- Conversational Language model toolkit for training against human preferences.β42Updated last year
- A cog implementation of MosaicML's MPT-7B-StoryWriter-65k+ Large Language Modelβ57Updated 2 years ago
- text-to-audio-latent-diffusionβ37Updated 2 years ago
- Voxtral: Convert Mistral into a end2end SpeechLM. No information bottleneck, preserves prosody, learns interruptions from data. Unlike GPβ¦β37Updated 9 months ago
- Experimental sampler to make LLMs more creativeβ31Updated 2 years ago
- Fork of AudioLDM as a TuneFlow pluginβ41Updated 2 years ago
- β15Updated last year
- β62Updated last year
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).β46Updated last year
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zooβ25Updated 2 years ago
- A lightweight Python library for running TTS models with a unified API.β21Updated 10 months ago
- π¬ "Realtime" voice transcription and cloning using ElevenLabs's API.β54Updated 2 years ago
- Cog wrapper for collabora/WhisperSpeechβ25Updated last year