zawawiAI / BLIP_CAMLinks
BLIP Live Image Captioning with Real-Time Video Stream This repository provides a Python-based implementation for real-time image captioning using the BLIP (Bootstrapped Language-Image Pretraining) model. The program captures live video from a webcam.
☆41Updated 10 months ago
Alternatives and similar repositories for BLIP_CAM
Users that are interested in BLIP_CAM are comparing it to the libraries listed below
Sorting:
- Text Behind Video. Enjoy it is completely free.☆31Updated 8 months ago
- Hector RAG is a modular RAG framework built on PostgreSQL, offering advanced retrieval methods and fusion techniques for AI-driven applic…☆59Updated 8 months ago
- ☆20Updated 2 months ago
- AI agent to automatically check grammar and spelling on documentation files☆93Updated 3 months ago
- IntelliJ Plugin that offers an infinite canvas to organize code bookmarks☆17Updated 5 months ago
- Talk to YouTube☆41Updated 2 years ago
- AI-powered text compression tool that condenses content while preserving meaning across multiple formats.☆22Updated last year
- Trim and timestamp audio, in the terminal☆14Updated last year
- Converts all website content into a text file for uploading to a custom GPT☆37Updated 9 months ago
- Swap your face in real-time☆75Updated 7 months ago
- A real-time, instant dictation desktop application built on Electron that uses Whisper and GROQ under the hood☆59Updated last year
- 🐝 Create powerful, collaborative AI applications.☆64Updated last year
- ☆101Updated 8 months ago
- Straighten up your workday | Posture Monitoring using AirPods Motion Sensors☆40Updated 5 months ago
- A tool for summarizing dialogues from videos or audio☆83Updated 2 years ago
- Flux and Stable Diffusion WebUI with MCP☆36Updated 5 months ago
- Turn any document into ready-to-use AI image prompts.☆54Updated 2 months ago
- rec-all: A Time Machine for the Everyday☆17Updated 11 months ago
- yt-chat is a tool designed to help you summarize any Youtube video.☆46Updated last year
- YouTube History Analyzer☆31Updated 5 months ago
- ☆20Updated 3 months ago
- A Lightweight Library for LLM I/O☆117Updated 6 months ago
- Organize and classify files based on their content using NLP☆70Updated last month
- An open-source alternative to automations and AI workers☆51Updated 4 months ago
- A browser-based tool for comparing and combining before/after images. No server needed, runs entirely in your browser.☆18Updated 9 months ago
- Deidentify people's names and gender specific pronouns☆43Updated 6 months ago
- Convolutional Neural Network that classifies voice clips as human or AI with 94% accuracy.☆66Updated 4 months ago
- LLaMa 3.2 Multimodal Web UI is a user-friendly interface for interacting with the Ollama platform.☆34Updated last year
- Converting Google Maps Screenshot to 3D Model☆21Updated 4 months ago
- LLMDog is a command-line tool that helps developers share code with Large Language Models like Claude and ChatGPT.☆77Updated 7 months ago