lucasnewman / f5-tts-mlxLinks
Implementation of F5-TTS in MLX
☆554Updated 3 months ago
Alternatives and similar repositories for f5-tts-mlx
Users that are interested in f5-tts-mlx are comparing it to the libraries listed below
Sorting:
- Interface for OuteTTS models.☆1,318Updated this week
- A Fast TTS Engine☆514Updated 5 months ago
- Open source inference code for Rev's model☆407Updated 2 months ago
- first base model for full-duplex conversational audio☆1,749Updated 5 months ago
- Blazing fast whisper turbo for ASR (speech-to-text) tasks☆211Updated 8 months ago
- Whisper with Medusa heads☆842Updated 3 weeks ago
- An extremely fast implementation of whisper optimized for Apple Silicon using MLX.☆730Updated last year
- Run Orpheus 3B Locally With LM Studio☆428Updated 3 months ago
- A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.☆229Updated 5 months ago
- Local SRT/LLM/TTS Voicechat☆692Updated 8 months ago
- An implementation of the CSM(Conversation Speech Model) for Apple Silicon using MLX.☆359Updated last month
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆748Updated 3 weeks ago
- FastMLX is a high performance production ready API to host MLX models.☆308Updated 3 months ago
- Examples for Cerebrium Serverless GPUs☆492Updated last week
- ☆480Updated last week
- Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".☆911Updated 7 months ago
- An implementation of the Nvidia's Parakeet models for Apple Silicon using MLX.☆259Updated this week
- MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple Silicon (M-series) chips. I…☆424Updated 2 weeks ago
- G2P☆262Updated last month
- LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆577Updated 2 months ago
- On-device Image Generation for Apple Silicon☆626Updated 2 months ago
- Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯☆851Updated 3 months ago
- Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits f…☆1,125Updated 2 months ago
- Joint speech-language model - respond directly to audio!☆369Updated 11 months ago
- ☆754Updated 2 months ago
- A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats includ…☆512Updated 3 weeks ago
- 📋 NotebookMLX - An Open Source version of NotebookLM (Ported NotebookLlama)☆296Updated 3 months ago
- Python tools for WhisperKit: Model conversion, optimization and evaluation☆219Updated last week
- High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.☆440Updated 2 months ago
- Phi-3.5 for Mac: Locally-run Vision and Language Models for Apple Silicon☆268Updated 9 months ago