RayFernando1337 / MLX-Auto-Subtitled-Video-Generator
Generate accurate transcripts using Apple's MLX framework
β368Updated 2 months ago
Alternatives and similar repositories for MLX-Auto-Subtitled-Video-Generator:
Users that are interested in MLX-Auto-Subtitled-Video-Generator are comparing it to the libraries listed below
- Gemini Multimodal Live + WebRTC in a single `app.ts`β182Updated last month
- Use OpenAI's realtime API for a chatting with your documentsβ314Updated 4 months ago
- π NotebookMLX - An Open Source version of NotebookLM (Ported NotebookLlama)β237Updated 3 months ago
- π€β¨ChatMLX is a modern, open-source, high-performance chat application for MacOS based on large language models.β698Updated 3 months ago
- Realtime API with Firecrawl Tool - Forked from the OpenAI Realtime Consoleβ153Updated 4 months ago
- A Model Context Protocol server for converting almost anything to Markdownβ137Updated 3 weeks ago
- β249Updated last week
- podcastfy.ai gradio demo appβ327Updated 2 months ago
- MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple Silicon (M-series) chips. Iβ¦β254Updated this week
- Blazing fast whisper turbo for ASR (speech-to-text) tasksβ193Updated 4 months ago
- A powerful Python tool that leverages Claude 3.5 Sonnet Vision API to detect and visualize objects in images. The script automatically drβ¦β189Updated 3 months ago
- AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and moreβ286Updated 6 months ago
- Building Blocks for Multi-Modal Gradio Powered by Groq Appsβ102Updated 3 months ago
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard thoughβ393Updated last week
- An opensource implementation of NotebookLM using Deepseek-V3 and PlayHT TTS.β237Updated last month
- Example UI implementing the RTVI web clientβ474Updated 2 months ago
- open-source browser extension that leverages the power of the AI to generate engaging replies for social media growth.β229Updated 9 months ago
- Implementation of F5-TTS in MLXβ475Updated 2 weeks ago
- This repository contains the code for a virtual try-on application built using Flask, Twilio's WhatsApp API, and Gradio's virtual try-on β¦β332Updated 3 months ago
- The AI assistant for computer control.β298Updated 4 months ago
- Turn local files into a prompt for an LLMβ164Updated last month
- AI-powered dictation toolβ350Updated 2 months ago
- SearchGPT / Perplexity Pages clone, but personalised for you.β235Updated 5 months ago
- Generate descriptions from product images in multiple languages with AIβ272Updated 3 weeks ago
- Claude can perform Web Search | Exa with MCP (Model Context Protocol)β225Updated last month
- Daily Bots Web Demo showcasing how to build real-time voice AI agentsΒβ203Updated 3 months ago
- Build realtime voice and video agents with Google's new Gemini 2.0 (API is free for now)β255Updated 2 weeks ago
- β281Updated 8 months ago