gradient-ai / Whisper-AutoCaption
☆93Updated last year
Alternatives and similar repositories for Whisper-AutoCaption
Users that are interested in Whisper-AutoCaption are comparing it to the libraries listed below
Sorting:
- Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS app☆203Updated 11 months ago
- Chat to Compose Video☆186Updated last year
- faster-whisper as serverless endpoint☆98Updated last week
- chatbot framework that allows for the creation of highly customized models using structured prompts against the base text-davinci models.…☆31Updated last year
- Record a sample of your own voice and let AI narrate the text in your own voice.☆80Updated last year
- web based editor for subtitles and transcripts☆130Updated 9 months ago
- Fast Audio/Video transcribe using Openai's Whisper and Modal, an hour audio/video file can be transcribed in ~1 minute☆79Updated last year
- Production-ready audio and video transcription app that can run on your laptop or in the cloud.☆72Updated last year
- An JS web client for connecting to Pipecat bots with voice and vision☆44Updated 4 months ago
- Generate visual podcasts about novels using open source models☆25Updated 2 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated last year
- Powered by OpenAI Whisper & Gradio☆30Updated 2 years ago
- ☆68Updated last year
- This package is the Python implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid…☆19Updated 7 months ago
- Input a YouTube video link or upload a video file and get a video with subtitles.☆119Updated 8 months ago
- Falcon40B and 7B (Instruct) with streaming, top-k, and beam search☆40Updated last year
- ☆72Updated 2 years ago
- Generate meaningful quotes from books, articles, or literally anything that can be turned into a PDF.☆33Updated 2 years ago
- LLM chatbot server with ChatGPT plugins☆38Updated 2 years ago
- Transcribe and summarize videos using whisper and llms on apple mlx framework☆74Updated last year
- Podalize: Podcast Transcription and Analysis☆155Updated 8 months ago
- Langchain tools to search/extract/transcribe text transcripts of Youtube videos. Some of this has been integrated into LangChain main bra…☆69Updated last year
- A feed of trending repos/models from GitHub, Replicate, HuggingFace, and Reddit.☆133Updated 8 months ago
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆350Updated 11 months ago
- ChatGPT-based voice Telegram bot☆19Updated 5 months ago
- Extract information, summarize, ask questions, and search videos using OpenAI's Vision API 🚀🎦☆62Updated last year
- Application to take in video urls and stream either transcripts or tokens.☆74Updated last year
- FastHTML app that makes other FastHTML apps with LLMs☆17Updated 8 months ago
- Uses Langchain to semantic search over a chat conversation☆38Updated 2 years ago
- A backend API to perform search over Wikipedia using LangChain, Cohere and Weaviate☆105Updated 2 years ago