zawawiAI / BLIP_CAM
BLIP Live Image Captioning with Real-Time Video Stream This repository provides a Python-based implementation for real-time image captioning using the BLIP (Bootstrapped Language-Image Pretraining) model. The program captures live video from a webcam.
β21Updated 2 weeks ago
Alternatives and similar repositories for BLIP_CAM:
Users that are interested in BLIP_CAM are comparing it to the libraries listed below
- AI-powered text compression tool that condenses content while preserving meaning across multiple formats.β20Updated 3 months ago
- Talk to YouTubeβ41Updated last year
- π Create powerful, collaborative AI applications.β63Updated 2 months ago
- Trim and timestamp audio, in the terminalβ13Updated 3 months ago
- β9Updated last month
- This project utilizes the KlingAI API to provide a virtual try-on experience using images of people and garments.β26Updated 2 months ago
- A tool for summarizing dialogues from videos or audioβ80Updated last year
- rec-all: A Time Machine for the Everydayβ17Updated last month
- Converts all website content into a text file for uploading to a custom GPTβ30Updated last week
- β34Updated 3 months ago
- Dialoqbase Lite is a Chrome extension that offers a web-based UI and a side panel, Copilot, designed specifically for almost all AI proviβ¦β39Updated 7 months ago
- A starting take on a fast and fully local NLP file organizer that organizes files based on their content.β59Updated last month
- A demo of cluade computer use playing minecraftβ14Updated 2 months ago
- An open-source alternative to automations and AI workersβ47Updated 3 months ago
- AI agent to automatically check grammar and spelling on documentation filesβ77Updated 3 months ago
- β52Updated 11 months ago
- A light weight python software for semi-supervised segmentation on images.β26Updated 11 months ago
- yt-chat is a tool designed to help you summarize any Youtube video.β46Updated 7 months ago
- The Fastest way to build apps in pythonβ73Updated 9 months ago
- simulai is a Notion-inspired open-source and free conversational survey builder, powered by AI.β98Updated 5 months ago
- A real-time, instant dictation desktop application built on Electron that uses Whisper and GROQ under the hoodβ48Updated 5 months ago
- Screen complete is a proof of concept universal screenshot-based text completion tool.β15Updated 3 months ago
- A Lightweight Library for LLM I/Oβ106Updated last week
- A Tiny Dall-E 3 UI for your homelabβ40Updated 5 months ago
- Autoformatted file layout using sections (imports, constants, classes, functions).β57Updated 4 months ago
- A neural net to transform a video into audio in real time.β22Updated 2 years ago
- font-classifyβ70Updated 9 months ago
- WhisperClip simplifies your life by automatically transcribing audio recordings and saving the text directly to your clipboard. With justβ¦β119Updated 7 months ago
- Turn your images into detailed and descriptive text prompts with AIβ27Updated 7 months ago
- A interactive notebook for the terminalβ29Updated 2 months ago