RayFernando1337 / MLX-Auto-Subtitled-Video-GeneratorLinks
Generate accurate transcripts using Apple's MLX framework
☆430Updated 3 months ago
Alternatives and similar repositories for MLX-Auto-Subtitled-Video-Generator
Users that are interested in MLX-Auto-Subtitled-Video-Generator are comparing it to the libraries listed below
Sorting:
- A powerful Python tool that leverages Claude 3.5 Sonnet Vision API to detect and visualize objects in images. The script automatically dr…☆209Updated 8 months ago
- podcastfy.ai gradio demo app☆335Updated 8 months ago
- 📋 NotebookMLX - An Open Source version of NotebookLM (Ported NotebookLlama)☆305Updated 4 months ago
- The easiest way to run the fastest MLX-based LLMs locally☆293Updated 9 months ago
- ☆287Updated last year
- Pipecat voice AI agents running locally on macOS☆75Updated this week
- 🤖✨ChatMLX is a modern, open-source, high-performance chat application for MacOS based on large language models.☆799Updated 4 months ago
- Blazing fast whisper turbo for ASR (speech-to-text) tasks☆212Updated 9 months ago
- Realtime API with Firecrawl Tool - Forked from the OpenAI Realtime Console☆159Updated 9 months ago
- Use OpenAI's realtime API for a chatting with your documents☆331Updated 9 months ago
- AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more☆293Updated last year
- Gemini Multimodal Live + WebRTC in a single `app.ts`☆210Updated 7 months ago
- ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3☆494Updated 6 months ago
- Real-Time Voice Inference Web SDK☆265Updated last week
- The AI assistant for computer control.☆318Updated 10 months ago
- Implementation of F5-TTS in MLX☆567Updated 4 months ago
- open-source browser extension that leverages the power of the AI to generate engaging replies for social media growth.☆239Updated last year
- Build realtime voice and video agents with Google's new Gemini 2.0 (API is free for now)☆297Updated 2 weeks ago
- Chat Application Starter Kit — Gemini Multimodal Live API + Pipecat☆206Updated 4 months ago
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard though☆555Updated 2 months ago
- ☆247Updated 6 months ago
- An implementation of the CSM(Conversation Speech Model) for Apple Silicon using MLX.☆367Updated 2 months ago
- An opensource implementation of NotebookLM using Deepseek-V3 and PlayHT TTS.☆283Updated 6 months ago
- ☆469Updated 2 weeks ago
- A NextJS/Langflow based app that takes a PDF and converts it into a podcast.☆221Updated 9 months ago
- Daily Bots Web Demo showcasing how to build real-time voice AI agents☆244Updated 9 months ago
- SearchGPT / Perplexity Pages clone, but personalised for you.☆244Updated 11 months ago
- ☆442Updated last month
- Get started with native image generation and editing using Gemini 2.0 and Next.js☆480Updated 2 months ago
- mind map generator☆72Updated 7 months ago