simonw / blip-caption
Generate captions for images with Salesforce BLIP
☆122Updated 9 months ago
Alternatives and similar repositories for blip-caption:
Users that are interested in blip-caption are comparing it to the libraries listed below
- Embedding models from Jina AI☆59Updated last year
- LLM plugin for models hosted on Replicate☆62Updated last year
- The repository provides code for training the SegmentAnything Model (SAM) for predicting frame polygons in comic books☆50Updated last year
- Simple script that reads an image and dumps the text it reads using a vision model and KobolodCPP☆56Updated last week
- The LLM plugins directory☆42Updated last year
- Use Datasette to explore LAION improved_aesthetics_6plus training data used by Stable DIffusion☆58Updated last year
- Some tough questions to test new models.☆27Updated last year
- Wikidata's QRank as a SQLite DB.☆28Updated last year
- Run Latent Consistency Models on your Mac☆198Updated last year
- Plugin for LLM adding support for Anthropic's Claude models☆37Updated 5 months ago
- Using Datasette and CLIP embeddings to find similar faucets.☆22Updated last year
- ☆27Updated 7 months ago
- https://verdad.app☆82Updated 4 months ago
- A simple package to use CLIP on apple silicon using the MLX libraries from Apple☆69Updated last year
- Hubcap is an autonomous AI agent in 25 lines of code: a small Autobot that you can't trust. *This is the Python fork/port* from https://g…☆14Updated last year
- llm plugin for Cerebras fast inference API☆25Updated last month
- LLM plugin for embeddings using sentence-transformers☆59Updated last week
- CLI for running files through AWS Textract☆54Updated last year
- Command-line tool for fetching JSON from paginated APIs☆67Updated last year
- LLM plugin for pulling content from Hacker News☆95Updated 3 weeks ago
- Bulk image tagging using OpenAI GPT-4 Vision☆70Updated 9 months ago
- What if an HNSW index was just a file, and you could serve it from a CDN, and search it directly in the browser?☆98Updated 3 weeks ago
- A Next.js app for fast image generation with Flux on Replicate☆107Updated 6 months ago
- A copy of ONNX models, datasets, and code all in one GitHub repository. Follow the README to learn more.☆105Updated last year
- Tools for running OCR against files stored in S3☆119Updated 2 years ago
- A web-app to explore topics using LLM (less typing and more clicks)☆66Updated last year
- ☆92Updated 2 years ago
- Make tool-calling schemas for existing tools☆14Updated last month
- Perform OCR upon entire videos to look for credentials or similar.☆40Updated 2 years ago
- CLI tool for running text through OpenAI Text to speech☆165Updated last year