austin-bowen / voiceboxLinks
Python text-to-speech library with built-in voice effects and support for multiple TTS engines
☆23Updated 3 months ago
Alternatives and similar repositories for voicebox
Users that are interested in voicebox are comparing it to the libraries listed below
Sorting:
- An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker …☆20Updated 9 months ago
- Rust bindings for CTranslate2☆14Updated 2 years ago
- Turning films into structured data to unlock the vast wealth of emotional knowledge within.☆30Updated 3 years ago
- Monkey Island fine-tune of Stable Diffusion☆10Updated 2 years ago
- Creates video from TTS output and viseme images.☆12Updated 3 years ago
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆12Updated 9 months ago
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Updated last year
- audio, NLP, ML with huggingface, nvidia/nemo, speechbrain☆11Updated last year
- A repo with scripts to test and play around with Facebook's recent llama models! 🤗☆28Updated last year
- A corpus of speech from the Joe Rogan Experience podcast, consisting of 8.43 million words. It includes aligned TextGrids with phonetic a…☆18Updated 5 years ago
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆20Updated 7 months ago
- YouTube Assistant☆12Updated 2 years ago
- Fast and accurate natural language detection. Detector written in Python. Nito-ELD, ELD.☆17Updated last year
- ☆15Updated 3 months ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆36Updated 4 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- GreenLIT: Using GPT-J with Multi-Task Learning to Create New Screenplays☆17Updated 2 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated 2 years ago
- ☆18Updated 2 years ago
- Tool for the Automatic Assessment of Lexical Diversity☆12Updated 4 years ago
- A composition of offline tools to achieve high quality multilingual speech to text transcription☆18Updated 3 weeks ago
- A python library to find differences between audio and transcriptions☆20Updated last year
- A lightweight Python library for running TTS models with a unified API.☆20Updated 4 months ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆24Updated 4 years ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆19Updated 2 years ago
- Stable diffusion dataset editor made with NiceGUI☆9Updated 2 years ago
- Unofficial Bark API☆9Updated last year
- Minimal, clean code for video/image "patchnization" - a process commonly used in tokenizing visual data for use in a Transformer encoder.…☆11Updated last year
- GGML implementation of BERT model with Python bindings and quantization.☆55Updated last year
- TTS Client for Coqui TTS server☆13Updated 2 years ago