fakerybakery / simpletts
A lightweight Python library for running TTS models with a unified API.
β17Updated last month
Alternatives and similar repositories for simpletts:
Users that are interested in simpletts are comparing it to the libraries listed below
- π³ AyaMCooking is a Voice-to-Voice Mutli-lingual RAG Agent that makes a perfect sous chef for your kitchen, in upto 10 Languages π€π§βπ³β21Updated 5 months ago
- Speech to Speech conversation using the OpenAI RealTime API in Python πβ23Updated 4 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.β60Updated last week
- Cog wrapper for collabora/WhisperSpeechβ25Updated last year
- β20Updated last year
- [WIP] AI Try-On plugin for Chromeβ27Updated last year
- An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker β¦β20Updated 6 months ago
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"β15Updated 4 months ago
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, andβ¦β56Updated last week
- β12Updated last year
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the creβ¦β17Updated 5 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Modelsβ21Updated 3 months ago
- β62Updated 7 months ago
- β17Updated last year
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.β20Updated 9 months ago
- A clone of OpenAI's Tokenizer page for HuggingFace Modelsβ45Updated last year
- Seamless Voice Interactions with LLMsβ12Updated last year
- A python library to find differences between audio and transcriptionsβ16Updated last year
- Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLXβ27Updated 5 months ago
- A list of language models with permissive licenses such as MIT or Apache 2.0β24Updated 3 weeks ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressorβ¦β57Updated 11 months ago
- Speaker diarization serviceβ21Updated last month
- Apps that run on modal.comβ12Updated 9 months ago
- β26Updated last year
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLXβ16Updated 4 months ago
- β29Updated last year