fakerybakery / simpletts
A lightweight Python library for running TTS models with a unified API.
β16Updated this week
Alternatives and similar repositories for simpletts:
Users that are interested in simpletts are comparing it to the libraries listed below
- Speech to Speech conversation using the OpenAI RealTime API in Python πβ22Updated 3 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.β57Updated last week
- β62Updated 6 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the creβ¦β17Updated 4 months ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jaxβ12Updated 8 months ago
- a simple system for 2-way interruptible voice interactions between human and LLMβ22Updated last year
- π³ AyaMCooking is a Voice-to-Voice Mutli-lingual RAG Agent that makes a perfect sous chef for your kitchen, in upto 10 Languages π€π§βπ³β21Updated 3 months ago
- Trying to build an all in one speech-text language model - a bit like GPT-4oβ22Updated 8 months ago
- An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker β¦β20Updated 5 months ago
- Cog wrapper for collabora/WhisperSpeechβ25Updated 11 months ago
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.β15Updated 3 months ago
- Joint speech-language model - respond directly to audio!β30Updated 9 months ago
- Tools for formatting large language model prompts.β12Updated last year
- Speaker diarization serviceβ21Updated this week
- Apps that run on modal.comβ12Updated 8 months ago
- Acoustic Neighbor Embeddingsβ21Updated 2 months ago
- Seamless Voice Interactions with LLMsβ11Updated last year
- Convert your PDFs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and efficientβ¦β44Updated last week
- StyleTTS 2 Optimized Training Forkβ22Updated 2 weeks ago
- Dippy Synthetic Speech Subnetβ15Updated this week
- ππ€ A collection of templates for Hugging Face Spacesβ35Updated last year
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLXβ16Updated 3 months ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription usingβ¦β28Updated last year
- Generate visual podcasts about novels using open source modelsβ25Updated 2 years ago
- Audio tokenization, in the fastest way possible!β48Updated 5 months ago
- Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLXβ26Updated 4 months ago
- β16Updated last year
- A clone of OpenAI's Tokenizer page for HuggingFace Modelsβ44Updated last year
- β30Updated last year
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.β27Updated last year