RayFernando1337 / MLX-Auto-Subtitled-Video-GeneratorLinks
Generate accurate transcripts using Apple's MLX framework
β443Updated 7 months ago
Alternatives and similar repositories for MLX-Auto-Subtitled-Video-Generator
Users that are interested in MLX-Auto-Subtitled-Video-Generator are comparing it to the libraries listed below
Sorting:
- π€β¨ChatMLX is a modern, open-source, high-performance chat application for MacOS based on large language models.β813Updated 8 months ago
- A powerful Python tool that leverages Claude 3.5 Sonnet Vision API to detect and visualize objects in images. The script automatically drβ¦β218Updated last year
- podcastfy.ai gradio demo appβ334Updated last year
- AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and moreβ293Updated last year
- π NotebookMLX - An Open Source version of NotebookLM (Ported NotebookLlama)β324Updated 8 months ago
- Use OpenAI's realtime API for a chatting with your documentsβ331Updated last year
- ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3β500Updated 3 months ago
- Realtime API with Firecrawl Tool - Forked from the OpenAI Realtime Consoleβ160Updated last year
- The AI assistant for computer control.β321Updated last year
- Gemini Multimodal Live + WebRTC in a single `app.ts`β211Updated last month
- open-source browser extension that leverages the power of the AI to generate engaging replies for social media growth.β240Updated last year
- Blazing fast whisper turbo for ASR (speech-to-text) tasksβ218Updated 2 weeks ago
- β285Updated last year
- The easiest way to run the fastest MLX-based LLMs locallyβ306Updated last year
- Implementation of F5-TTS in MLXβ596Updated 8 months ago
- Real-Time Voice Inference Web SDKβ291Updated last week
- Daily Bots Web Demo showcasing how to build real-time voice AI agentsΒβ246Updated 2 months ago
- SearchGPT / Perplexity Pages clone, but personalised for you.β245Updated last year
- β251Updated 10 months ago
- Building Blocks for Multi-Modal Gradio Powered by Groq Appsβ115Updated last year
- An opensource implementation of NotebookLM using Deepseek-V3 and PlayHT TTS.β289Updated 10 months ago
- A NextJS/Langflow based app that takes a PDF and converts it into a podcast.β226Updated last year
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard thoughβ562Updated last week
- Example UI implementing the RTVI web clientβ476Updated 11 months ago
- This repository contains the code for a virtual try-on application built using Flask, Twilio's WhatsApp API, and Gradio's virtual try-on β¦β348Updated last year
- Chat Application Starter Kit β Gemini Multimodal Live API + Pipecatβ221Updated last month
- Filter X content using LLM API requests, configurable, based on Groq APIβ132Updated last year
- β545Updated 2 months ago
- AI Meeting Minutes analysis App built with NextJS, Langflow, Groq, and OpenAIβ489Updated 11 months ago
- β814Updated last year