lucasnewman / vocos-mlxLinks
Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX
☆19Updated 7 months ago
Alternatives and similar repositories for vocos-mlx
Users that are interested in vocos-mlx are comparing it to the libraries listed below
Sorting:
- Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in MLX☆20Updated 7 months ago
- Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX☆27Updated 7 months ago
- Open TTS models, built for streaming on the edge☆43Updated 2 months ago
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆18Updated last week
- Acoustic Neighbor Embeddings☆23Updated 5 months ago
- Shared personal notes created while working with the Apple MLX machine learning framework☆24Updated 2 weeks ago
- Thin wrapper around GGML to make life easier☆34Updated this week
- Open-source and reproducible benchmarks for Speaker Diarization☆26Updated last month
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆19Updated 7 months ago
- Find out why your CoreML model isn't running on the Neural Engine!☆25Updated 11 months ago
- implementation of https://arxiv.org/pdf/2312.09299☆20Updated 11 months ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆14Updated 11 months ago
- Cog wrapper for collabora/WhisperSpeech☆25Updated last year
- Profile your CoreML models directly from Python 🐍☆27Updated 7 months ago
- Training hybrid models for dummies.☆21Updated 4 months ago
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆17Updated 9 months ago
- Rust crate for some audio utilities☆23Updated 2 months ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆38Updated 2 weeks ago
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆21Updated 9 months ago
- ☆27Updated last year
- Minimal, clean code implementation of RAG with mlx using gguf model weights☆50Updated last year
- ANE accelerated embedding models!☆17Updated 5 months ago
- ☆62Updated 10 months ago
- A lightweight Python library for running TTS models with a unified API.☆18Updated 3 months ago
- Ultra-minimal autoregressive diffusion model for image generation☆19Updated 8 months ago
- MLX Swift implementation of Andrej Karpathy's Let's build GPT video☆58Updated last year
- ☆20Updated 2 weeks ago
- mlx image models for Apple Silicon machines☆80Updated last month
- ☆22Updated last year
- Example of finetuning CLIP to identify plants.☆11Updated 10 months ago