RayFernando1337 / MLX-Auto-Subtitled-Video-GeneratorLinks

Generate accurate transcripts using Apple's MLX framework

☆430

Alternatives and similar repositories for MLX-Auto-Subtitled-Video-Generator

Users that are interested in MLX-Auto-Subtitled-Video-Generator are comparing it to the libraries listed below

Sorting:

Doriandarko / Claude-Vision-Object-Detection
A powerful Python tool that leverages Claude 3.5 Sonnet Vision API to detect and visualize objects in images. The script automatically dr…
☆209Updated 8 months ago
souzatharsis / podcastfy-demo
podcastfy.ai gradio demo app
☆335Updated 8 months ago
johnmai-dev / NotebookMLX
📋 NotebookMLX - An Open Source version of NotebookLM (Ported NotebookLlama)
☆305Updated 4 months ago
PicoMLX / PicoMLXServer
The easiest way to run the fastest MLX-based LLMs locally
☆293Updated 9 months ago
siddrrsh / ambientGPT
☆287Updated last year
kwindla / macos-local-voice-agents
Pipecat voice AI agents running locally on macOS
☆75Updated this week
johnmai-dev / ChatMLX
🤖✨ChatMLX is a modern, open-source, high-performance chat application for MacOS based on large language models.
☆799Updated 4 months ago
JosefAlbers / whisper-turbo-mlx
Blazing fast whisper turbo for ASR (speech-to-text) tasks
☆212Updated 9 months ago
nickscamara / firecrawl-openai-realtime
Realtime API with Firecrawl Tool - Forked from the OpenAI Realtime Console
☆159Updated 9 months ago
run-llama / voice-chat-pdf
Use OpenAI's realtime API for a chatting with your documents
☆331Updated 9 months ago
developersdigest / ai-devices
AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more
☆293Updated last year
pipecat-ai / gemini-webrtc-web-simple
Gemini Multimodal Live + WebRTC in a single `app.ts`
☆210Updated 7 months ago
Bklieger / ScribeWizard
ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3
☆494Updated 6 months ago
pipecat-ai / pipecat-client-web
Real-Time Voice Inference Web SDK
☆265Updated last week
openinterpreter / 01-app
The AI assistant for computer control.
☆318Updated 10 months ago
lucasnewman / f5-tts-mlx
Implementation of F5-TTS in MLX
☆567Updated 4 months ago
jose-donato / ollama-reply
open-source browser extension that leverages the power of the AI to generate engaging replies for social media growth.
☆239Updated last year
saharmor / gemini-multimodal-playground
Build realtime voice and video agents with Google's new Gemini 2.0 (API is free for now)
☆297Updated 2 weeks ago
pipecat-ai / gemini-multimodal-live-demo
Chat Application Starter Kit — Gemini Multimodal Live API + Pipecat
☆206Updated 4 months ago
SouthBridgeAI / offmute
An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard though
☆555Updated 2 months ago
CognosysAI / browser
☆247Updated 6 months ago
senstella / csm-mlx
An implementation of the CSM(Conversation Speech Model) for Apple Silicon using MLX.
☆367Updated 2 months ago
satvik314 / opensource_notebooklm
An opensource implementation of NotebookLM using Deepseek-V3 and PlayHT TTS.
☆283Updated 6 months ago
elevenlabs / elevenlabs-examples
☆469Updated 2 weeks ago
misbahsy / Doc2Podcast
A NextJS/Langflow based app that takes a PDF and converts it into a podcast.
☆221Updated 9 months ago
daily-demos / daily-bots-web-demo
Daily Bots Web Demo showcasing how to build real-time voice AI agents
☆244Updated 9 months ago
alexfazio / OpenPlexity-Pages
SearchGPT / Perplexity Pages clone, but personalised for you.
☆244Updated 11 months ago
mendableai / llmstxt-generator
☆442Updated last month
google-gemini / gemini-image-editing-nextjs-quickstart
Get started with native image generation and editing using Gemini 2.0 and Next.js
☆480Updated 2 months ago
ericciarla / mind-map
mind map generator
☆72Updated 7 months ago