gradient-ai / Whisper-AutoCaption
☆93Updated last year
Alternatives and similar repositories for Whisper-AutoCaption:
Users that are interested in Whisper-AutoCaption are comparing it to the libraries listed below
- web based editor for subtitles and transcripts☆130Updated 8 months ago
- faster-whisper as serverless endpoint☆95Updated this week
- Chat to Compose Video☆185Updated last year
- An JS web client for connecting to Pipecat bots with voice and vision☆44Updated 4 months ago
- Script that takes any long form video or podcast and outputs clips for social media☆114Updated last year
- Generate visual podcasts about novels using open source models☆25Updated 2 years ago
- Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS app☆203Updated 10 months ago
- Podalize: Podcast Transcription and Analysis☆155Updated 7 months ago
- Transcribe and summarize videos using whisper and llms on apple mlx framework☆74Updated last year
- A curated list of awesome OpenAI's Whisper☆101Updated last year
- Transcription with speaker diarization pipeline☆92Updated last year
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆208Updated 5 months ago
- This package is the Python implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid…☆19Updated 6 months ago
- Fast Audio/Video transcribe using Openai's Whisper and Modal, an hour audio/video file can be transcribed in ~1 minute☆78Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated 11 months ago
- Powered by OpenAI Whisper & Gradio☆30Updated 2 years ago
- Langchain tools to search/extract/transcribe text transcripts of Youtube videos. Some of this has been integrated into LangChain main bra…☆69Updated last year
- The agentic video editing framework☆114Updated 2 months ago
- ☆204Updated 10 months ago
- chatbot framework that allows for the creation of highly customized models using structured prompts against the base text-davinci models.…☆31Updated last year
- ☆156Updated last year
- Python tools for WhisperKit: Model conversion, optimization and evaluation☆212Updated 3 months ago
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆349Updated 10 months ago
- Open-source Rewind.ai clone written in Rust and Vue running 100% locally with whisper.cpp☆50Updated last year
- Falcon40B and 7B (Instruct) with streaming, top-k, and beam search☆40Updated last year
- Input a YouTube video link or upload a video file and get a video with subtitles.☆118Updated 8 months ago
- 🦙 Inference code for LLaMA models (modified for cpu)☆12Updated 2 years ago
- Example of calling OpenRouter from a Streamit app☆94Updated last year
- OpenAI Whisper + davinci for podcast summarization☆71Updated last year
- AI assistant that Intuitively Adapts to You☆82Updated last year