ANTONIOPSD / CaptionIMGLinks
Simple program to manually caption your images (or any other file types) so you can use them for AI training
☆37Updated 2 years ago
Alternatives and similar repositories for CaptionIMG
Users that are interested in CaptionIMG are comparing it to the libraries listed below
Sorting:
- Cerule - A Tiny Mighty Vision Model☆66Updated 9 months ago
- ☆15Updated last year
- ☆63Updated 9 months ago
- ☆30Updated 8 months ago
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆64Updated 8 months ago
- A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…☆19Updated 8 months ago
- Recaption large (Web)Datasets with vllm and save the artifacts.☆52Updated 7 months ago
- Retrieve the source code for any model made available on replicate.com!☆34Updated last year
- ☆28Updated 10 months ago
- Gradio UI for a Cog API☆68Updated last year
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆81Updated last year
- Official PyTorch implementation of TokenSet.☆121Updated 3 months ago
- ☆31Updated last year
- Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️☆86Updated last year
- (AAAI'25) Training-and-pormpt Free General Painterly Image Harmonization Using image-wise attention sharing☆58Updated 6 months ago
- ☆21Updated last year
- Image Generation API Server - Similar to https://text-generator.io but for images☆50Updated 2 weeks ago
- Lego for GRPO☆28Updated last month
- ☆11Updated last year
- look how they massacred my boy☆63Updated 8 months ago
- [WIP] AI Try-On plugin for Chrome☆27Updated last year
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Updated last year
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆22Updated 7 months ago
- [WIP] Transformer to embed Danbooru labelsets☆13Updated last year
- Command-line script for inferencing from models such as WizardCoder☆26Updated last year
- ☆24Updated last year
- ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing☆69Updated last year
- ☆68Updated last year
- faster parallel inference of mochi-1 video generation model☆121Updated 4 months ago
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)☆34Updated 4 months ago