lucasnewman / f5-tts-mlx
Implementation of F5-TTS in MLX
☆329Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for f5-tts-mlx
- Interface for OuteTTS models.☆409Updated 2 weeks ago
- FastMLX is a high performance production ready API to host MLX models.☆219Updated this week
- Generate accurate transcripts using Apple's MLX framework☆324Updated last week
- 📋 NotebookMLX - An Open Source version of NotebookLM (Ported NotebookLlama)☆184Updated 3 weeks ago
- An extremely fast implementation of whisper optimized for Apple Silicon using MLX.☆585Updated 6 months ago
- On-device Inference of Diffusion Models for Apple Silicon☆510Updated 3 weeks ago
- Blazing fast whisper turbo for ASR (speech-to-text) tasks☆162Updated last month
- MLX-VLM is a package for running Vision LLMs locally on your Mac using MLX.☆498Updated this week
- ☆446Updated this week
- Local SRT/LLM/TTS Voicechat☆546Updated last month
- Phi-3.5 for Mac: Locally-run Vision and Language Models for Apple Silicon☆237Updated 2 months ago
- Joint speech-language model - respond directly to audio!☆356Updated 4 months ago
- Whisper with Medusa heads☆799Updated 3 weeks ago
- SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.☆226Updated this week
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard though☆323Updated 2 weeks ago
- Open source inference code for Rev's model☆335Updated last week
- ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3☆457Updated 2 months ago
- Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".☆781Updated 3 weeks ago
- podcastfy.ai gradio demo app☆312Updated last week
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit☆718Updated 3 months ago
- A fast multimodal LLM for real-time voice☆1,366Updated this week
- ☆278Updated 5 months ago
- Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯☆737Updated 2 weeks ago
- Python tools for WhisperKit: Model conversion, optimization and evaluation☆171Updated 2 weeks ago
- LLM, MultiModal, and Agent tools for ComfyUI☆316Updated 3 months ago
- first base model for full-duplex conversational audio☆1,574Updated last week
- ☆286Updated last month
- The AI assistant for computer control.☆265Updated 2 months ago
- ☆170Updated 3 months ago
- AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more☆281Updated 4 months ago