lucasnewman / vocos-mlx
Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX
☆16Updated 5 months ago
Alternatives and similar repositories for vocos-mlx:
Users that are interested in vocos-mlx are comparing it to the libraries listed below
- Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in MLX☆20Updated 5 months ago
- Training hybrid models for dummies.☆20Updated 2 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆18Updated 5 months ago
- ☆15Updated last year
- implementation of https://arxiv.org/pdf/2312.09299☆20Updated 8 months ago
- Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX☆27Updated 5 months ago
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆17Updated 7 months ago
- ANE accelerated embedding models!☆17Updated 3 months ago
- Shared personal notes created while working with the Apple MLX machine learning framework☆22Updated 9 months ago
- Cog wrapper for collabora/WhisperSpeech☆25Updated last year
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated last month
- ☆22Updated 9 months ago
- Example of finetuning CLIP to identify plants.☆11Updated 8 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆21Updated 4 months ago
- Profile your CoreML models directly from Python 🐍☆27Updated 5 months ago
- A lightweight Python library for running TTS models with a unified API.☆17Updated last month
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆15Updated 4 months ago
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- This repository shows how to use Q8 kernels with `diffusers` to optimize inference of LTX-Video on ADA GPUs.☆15Updated 2 months ago
- Github repo for Peifeng's internship project☆14Updated last year
- ☆32Updated 9 months ago
- Open TTS models, built for streaming on the edge☆39Updated 2 weeks ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆28Updated 3 weeks ago
- a simple system for 2-way interruptible voice interactions between human and LLM☆25Updated last year
- ☆28Updated last year
- ☆16Updated last year
- Rust bindings for CTranslate2☆14Updated last year
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated 9 months ago
- ☆26Updated last year
- Fine-tune of Florence-2 for shot categorization.☆22Updated 3 weeks ago