fakerybakery / simpletts
A lightweight Python library for running TTS models with a unified API.
☆17Updated 2 months ago
Alternatives and similar repositories for simpletts:
Users that are interested in simpletts are comparing it to the libraries listed below
- Open TTS models, built for streaming on the edge☆39Updated last month
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated last week
- a simple system for 2-way interruptible voice interactions between human and LLM☆25Updated last year
- Cog wrapper for collabora/WhisperSpeech☆24Updated last year
- ☆11Updated last month
- Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.☆14Updated last month
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆17Updated 5 months ago
- ☆62Updated 8 months ago
- Using modal.com to process FineWeb-edu data☆20Updated 2 weeks ago
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated last year
- Speech to Speech conversation using the OpenAI RealTime API in Python 🐍☆24Updated 5 months ago
- ANE accelerated embedding models!☆16Updated 4 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆19Updated 6 months ago
- Joint speech-language model - respond directly to audio!☆30Updated 11 months ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆14Updated last month
- Apps that run on modal.com☆12Updated 10 months ago
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆34Updated last year
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.☆17Updated 5 months ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆14Updated 10 months ago
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆16Updated 5 months ago
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆103Updated 2 weeks ago
- Seamless Voice Interactions with LLMs☆12Updated last year
- Audio tokenization, in the fastest way possible!☆51Updated 7 months ago
- ☆41Updated 2 months ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆57Updated last year
- Speaker diarization service☆21Updated last week
- ☆19Updated last month
- Tools for formatting large language model prompts.☆12Updated last year
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…☆27Updated 11 months ago
- Using short models to classify long texts☆21Updated 2 years ago