lucasnewman / vocos-mlx
Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX
☆16Updated 2 months ago
Alternatives and similar repositories for vocos-mlx:
Users that are interested in vocos-mlx are comparing it to the libraries listed below
- Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in MLX☆19Updated 3 months ago
- Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX☆26Updated 3 months ago
- implementation of https://arxiv.org/pdf/2312.09299☆20Updated 6 months ago
- Convert your PDFs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and efficient…☆37Updated 2 weeks ago
- StyleTTS 2 Optimized Training Fork☆18Updated this week
- Shared personal notes created while working with the Apple MLX machine learning framework☆21Updated 7 months ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆26Updated last week
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆17Updated this week
- G2P☆35Updated this week
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated 7 months ago
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆17Updated 5 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆17Updated 3 months ago
- Training hybrid models for dummies.☆18Updated 2 weeks ago
- Minimal, clean code implementation of RAG with mlx using gguf model weights☆46Updated 9 months ago
- Example of finetuning CLIP to identify plants.☆10Updated 6 months ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆12Updated 7 months ago
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 2 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆21Updated 2 months ago
- A lightweight Python library for running TTS models with a unified API.☆16Updated 2 weeks ago
- [WIP] Transformer to embed Danbooru labelsets☆13Updated 9 months ago
- A collection of optimizers for MLX☆29Updated this week
- A no-string framework for reasoning over your tabular data rows with any provided LLM☆13Updated this week
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆12Updated this week
- Measuring and Controlling Persona Drift in Language Model Dialogs☆15Updated 11 months ago
- Profile your CoreML models directly from Python 🐍☆26Updated 3 months ago
- ☆27Updated 5 months ago
- Easy to use, High Performant Knowledge Distillation for LLMs☆40Updated 2 weeks ago
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆22Updated 7 months ago
- mlx image models for Apple Silicon machines☆70Updated 2 months ago