austin-bowen / voiceboxLinks
Python text-to-speech library with built-in voice effects and support for multiple TTS engines
☆25Updated 8 months ago
Alternatives and similar repositories for voicebox
Users that are interested in voicebox are comparing it to the libraries listed below
Sorting:
- An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker …☆20Updated last year
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆13Updated last year
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆12Updated 9 months ago
- ☆19Updated 8 months ago
- Rust bindings for CTranslate2☆14Updated 2 years ago
- [NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech☆22Updated last year
- HuggingChat like UI in Gradio☆70Updated 2 years ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆15Updated last year
- Open TTS models, built for streaming on the edge☆44Updated 8 months ago
- A lightweight Python library for running TTS models with a unified API.☆21Updated 9 months ago
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆52Updated 4 years ago
- Demo combining Whisper for speech recognition and Google TTS for speech synthesis to interact with Alpaca-LoRA.☆19Updated last year
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Updated 2 years ago
- Sentence Embedding as a Service☆15Updated 4 months ago
- Creates video from TTS output and viseme images.☆15Updated 3 years ago
- Multimodal Open Source Framework for Conversational Agent Research and Development.☆21Updated 9 months ago
- Modify-Anything is based on yolov5,yolov8 for video and image detection. Segment-anything,lama_cleaner is applied to segment, modify, era…☆17Updated 2 years ago
- Trying to deconstruct RWKV in understandable terms☆14Updated 2 years ago
- Finetune any model on HF in less than 30 seconds☆55Updated last month
- ☆11Updated 2 years ago
- Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…☆24Updated 4 years ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆23Updated last year
- Zero-Shot Foreign Accent Conversion without a Native Reference☆35Updated last year
- ☆62Updated last year
- ☆22Updated 2 months ago
- GGML implementation of BERT model with Python bindings and quantization.☆56Updated last year
- ☆44Updated last year
- 🦁 Nala is an agile open-source voice assistant framework (20+ actions).☆35Updated 2 years ago
- Audio tokenization, in the fastest way possible!☆53Updated last year
- ☆16Updated 4 years ago