austin-bowen / voiceboxLinks
Python text-to-speech library with built-in voice effects and support for multiple TTS engines
☆23Updated 2 months ago
Alternatives and similar repositories for voicebox
Users that are interested in voicebox are comparing it to the libraries listed below
Sorting:
- An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker …☆20Updated 8 months ago
- CogNetX is an advanced, multimodal neural network architecture inspired by human cognition. It integrates speech, vision, and video proce…☆14Updated last week
- A pipeline to generate user-preferred photo-realistic avatars using stable-diffusion and bayesian-optimization.☆18Updated 3 weeks ago
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆12Updated 4 months ago
- GreenLIT: Using GPT-J with Multi-Task Learning to Create New Screenplays☆17Updated 2 years ago
- Voice cloning using coqui-TTS☆11Updated last year
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…☆12Updated last year
- Rust bindings for CTranslate2☆14Updated last year
- Train a production grade GPT in less than 400 lines of code. Better than Karpathy's verison and GIGAGPT☆15Updated last week
- Easily turn large sets of audio urls to an audio dataset.☆21Updated 2 years ago
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆19Updated 7 months ago
- A python library to find differences between audio and transcriptions☆20Updated last year
- ☆13Updated last year
- GPT-jax based on the official huggingface library☆13Updated 3 years ago
- Lyra V2 (SoundStream) running in the browser☆18Updated last year
- A Full-Duplex Open-Domain Dialogue Agent with Continuous Turn-Taking Behavior☆31Updated last year
- Cog wrapper for collabora/WhisperSpeech☆25Updated last year
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆19Updated 2 years ago
- A simple package for leveraging Falcon 180B and the HF ecosystem's tools, including training/inference scripts, safetensors, integrations…☆13Updated last year
- audio, NLP, ML with huggingface, nvidia/nemo, speechbrain☆11Updated last year
- [NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech☆22Updated 9 months ago
- ☆15Updated 2 months ago
- Integrate an LLM copilot within your Keras model development workflow☆28Updated last year
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Updated last year
- Cantonese Grapheme-to-Phoneme Converter based on GitYCC/g2pW☆13Updated 5 months ago
- implementation of https://arxiv.org/pdf/2312.09299☆20Updated 11 months ago
- Simple text to phonemes converter for multiple languages☆20Updated 2 years ago
- Universal text classifier for generative models☆24Updated 10 months ago
- ☆11Updated last year
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆16Updated 6 months ago