federicotorrielli / BetterWhisperX
Better WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
☆391Updated last week
Alternatives and similar repositories for BetterWhisperX:
Users that are interested in BetterWhisperX are comparing it to the libraries listed below
- A Fast TTS Engine☆405Updated last week
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard though☆368Updated 2 months ago
- Open source inference code for Rev's model☆361Updated this week
- Generate accurate transcripts using Apple's MLX framework☆354Updated last month
- Interface for OuteTTS models.☆859Updated this week
- Implementation of F5-TTS in MLX☆429Updated last week
- Use OpenAI's realtime API for a chatting with your documents☆302Updated 3 months ago
- Turn local files into a prompt for an LLM☆156Updated this week
- Examples for Cerebrium Serverless GPUs☆454Updated this week
- Parse PDFs into markdown using Vision LLMs☆194Updated last week
- ☆250Updated 4 months ago
- Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.☆78Updated 3 months ago
- AI reads books: Page-by-Page PDF Knowledge Extractor & Summarizer. script performs an intelligent page-by-page analysis of PDF books, met…☆743Updated this week
- An opensource implementation of NotebookLM using Deepseek-V3 and PlayHT TTS.☆187Updated 2 weeks ago
- 📋 NotebookMLX - An Open Source version of NotebookLM (Ported NotebookLlama)☆222Updated 2 months ago
- On-premises conversational RAG with configurable containers☆293Updated this week
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆518Updated 3 weeks ago
- Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯☆789Updated 3 weeks ago
- Local SRT/LLM/TTS Voicechat☆590Updated 3 months ago
- TTS with kokoro and onnx runtime☆953Updated this week
- AI-powered assistant to help you with your daily tasks, powered by Llama 3.2. It can recognize your voice, process natural language, and …☆466Updated last month
- podcastfy.ai gradio demo app☆326Updated last month
- A Gradio app that transcribes YouTube videos using audio extraction and OpenAI’s Whisper model.☆334Updated 3 months ago
- ⚡ Insanely fast AI voice assistant with <500ms response times☆357Updated last month
- ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3☆471Updated 4 months ago
- Gemini Multimodal Live + WebRTC in a single `app.ts`☆149Updated 3 weeks ago
- A powerful Python tool that leverages Claude 3.5 Sonnet Vision API to detect and visualize objects in images. The script automatically dr…☆186Updated 2 months ago
- ☆151Updated last month
- Trans Router☆149Updated this week
- SearchGPT / Perplexity Pages clone, but personalised for you.☆228Updated 4 months ago