mbotsu / mlx_speech2textLinks
Audio transcription using mlx whisper and vad silence processing
☆17Updated last year
Alternatives and similar repositories for mlx_speech2text
Users that are interested in mlx_speech2text are comparing it to the libraries listed below
Sorting:
- Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX☆29Updated last year
- cli tool to quantize gguf, gptq, awq, hqq and exl2 models☆78Updated last year
- Ultra-minimal autoregressive diffusion model for image generation☆21Updated 2 weeks ago
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆29Updated 9 months ago
- A collection of optimizers for MLX☆54Updated last month
- All the world is a play, we are but actors in it.☆49Updated 5 months ago
- ☆22Updated last year
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆19Updated last year
- Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in MLX☆20Updated last year
- Gradio based tool to run opensource LLM models directly from Huggingface☆96Updated last year
- Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Silicon☆16Updated 8 months ago
- ☆19Updated last year
- Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…☆23Updated 8 months ago
- ☆61Updated 7 months ago
- MCP Server implementation for Claude☆26Updated last year
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆25Updated 9 months ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated last year
- A highly optimized engine for neutts-air model to generate minutes of audio in seconds. Over 200x realtime on modern hardware!☆95Updated last month
- Yet Another (LLM) Web UI, made with Gemini☆12Updated last year
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆23Updated last year
- A little file for doing LLM-assisted prompt expansion and image generation using Flux.schnell - complete with prompt history, prompt queu…☆26Updated last year
- ☆18Updated last year
- Very basic framework for composable parameterized large language model (Q)LoRA / (Q)Dora fine-tuning using mlx, mlx_lm, and OgbujiPT.☆43Updated 6 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆23Updated last year
- Video+code lecture on building nanoGPT from scratch☆68Updated last year
- ☆15Updated 3 weeks ago
- Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.☆47Updated 2 months ago
- ☆24Updated 11 months ago
- entropix style sampling + GUI☆27Updated last year
- Cog wrapper for collabora/WhisperSpeech☆25Updated last year