lucasnewman / vocos-mlx
Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX
☆18Updated 6 months ago
Alternatives and similar repositories for vocos-mlx
Users that are interested in vocos-mlx are comparing it to the libraries listed below
Sorting:
- Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in MLX☆20Updated 7 months ago
- Shared personal notes created while working with the Apple MLX machine learning framework☆23Updated 10 months ago
- Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX☆27Updated 7 months ago
- Training hybrid models for dummies.☆21Updated 4 months ago
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆18Updated 3 weeks ago
- Thin wrapper around GGML to make life easier☆29Updated this week
- Acoustic Neighbor Embeddings☆22Updated 5 months ago
- Example of finetuning CLIP to identify plants.☆11Updated 10 months ago
- Open TTS models, built for streaming on the edge☆41Updated 2 months ago
- Open-source and reproducible benchmarks for Speaker Diarization☆24Updated 3 weeks ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated 11 months ago
- ☆19Updated last month
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 2 months ago
- ☆23Updated last year
- ANE accelerated embedding models!☆16Updated 5 months ago
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆17Updated 8 months ago
- Profile your CoreML models directly from Python 🐍☆27Updated 7 months ago
- StyleTTS 2 Optimized Training Fork☆28Updated 3 months ago
- Find out why your CoreML model isn't running on the Neural Engine!☆25Updated 10 months ago
- Minimal, clean code implementation of RAG with mlx using gguf model weights☆50Updated last year
- A lightweight Python library for running TTS models with a unified API.☆18Updated 2 months ago
- convert a saved pytorch model to gguf and generate as much corresponding ggml c code as possible☆14Updated last year
- implementation of https://arxiv.org/pdf/2312.09299☆20Updated 10 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆19Updated 7 months ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆14Updated 11 months ago
- Hanasu is a human-like TTS model based on the multilingual Himitsu V1 transformer-based encoder and VITS architecture☆28Updated last month
- mlx image models for Apple Silicon machines☆78Updated last month
- Repository for "TESS-2: A Large-Scale, Generalist Diffusion Language Model"☆34Updated 2 months ago
- Supervoice diffusion enhance☆26Updated 10 months ago