fakerybakery / simplettsLinks
A lightweight Python library for running TTS models with a unified API.
☆18Updated 3 months ago
Alternatives and similar repositories for simpletts
Users that are interested in simpletts are comparing it to the libraries listed below
Sorting:
- Open TTS models, built for streaming on the edge☆43Updated 2 months ago
- a simple system for 2-way interruptible voice interactions between human and LLM☆29Updated last year
- Open-source and reproducible benchmarks for Speaker Diarization☆25Updated last month
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated this week
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆19Updated 7 months ago
- Cog wrapper for collabora/WhisperSpeech☆24Updated last year
- ☆15Updated 2 months ago
- Audio tokenization, in the fastest way possible!☆52Updated 9 months ago
- Joint speech-language model - respond directly to audio!☆30Updated last year
- ☆62Updated 10 months ago
- proof of concept conversation orchestrator with a speech-language model☆20Updated 7 months ago
- ☆43Updated 3 months ago
- An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker …☆20Updated 8 months ago
- Speech to Speech conversation using the OpenAI RealTime API in Python 🐍☆26Updated 6 months ago
- Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX☆27Updated 7 months ago
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.☆17Updated 6 months ago
- Using modal.com to process FineWeb-edu data☆20Updated last month
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated 2 years ago
- Official code for "F5R-TTS: Improving Flow-Matching based Text-to-Speech with Group Relative Policy Optimization"☆38Updated this week
- ☆22Updated last year
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15Updated 2 weeks ago
- Apps that run on modal.com☆12Updated last year
- Trying to build an all in one speech-text language model - a bit like GPT-4o☆22Updated last year
- Tools for formatting large language model prompts.☆13Updated last year
- [WIP] AI Try-On plugin for Chrome☆27Updated last year
- ☆224Updated this week
- Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.☆19Updated 2 months ago
- Public reports detailing responses to sets of prompts by Large Language Models.☆30Updated 5 months ago
- ☆11Updated last month
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated last year