ANTONIOPSD / CaptionIMGLinks
Simple program to manually caption your images (or any other file types) so you can use them for AI training
☆37Updated 2 years ago
Alternatives and similar repositories for CaptionIMG
Users that are interested in CaptionIMG are comparing it to the libraries listed below
Sorting:
- Cerule - A Tiny Mighty Vision Model☆67Updated 11 months ago
- ☆63Updated 11 months ago
- A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…☆19Updated 10 months ago
- Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.☆90Updated last year
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆64Updated 11 months ago
- Gradio UI for a Cog API☆69Updated last year
- ☆50Updated last year
- (AAAI'25) Training-and-pormpt Free General Painterly Image Harmonization Using image-wise attention sharing☆59Updated 8 months ago
- ☆15Updated last year
- [NeurIPS XAIA & Springer] Code and notebooks to paper "A Fresh Look at Sanity Checks for Saliency Maps"☆25Updated last year
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆116Updated 3 weeks ago
- ☆30Updated 10 months ago
- Retrieve the source code for any model made available on replicate.com!☆35Updated last year
- Recaption large (Web)Datasets with vllm and save the artifacts.☆52Updated 9 months ago
- ☆69Updated last year
- Simplex Random Feature attention, in PyTorch☆74Updated last year
- I learn about and explain quantization☆26Updated last year
- Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️☆88Updated last year
- [NeurIPS VLM workshop 2024] In-Context Ensemble Learning from Pseudo Labels Improves Video-Language Models for Low-Level Workflow Underst…☆23Updated 5 months ago
- ☆26Updated last year
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆22Updated 9 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆84Updated last year
- Problem-Oriented Segmentation and Retrieval EMNLP 2024 Findings☆34Updated 9 months ago
- Image Generation API Server - Similar to https://text-generator.io but for images☆53Updated this week
- look how they massacred my boy☆64Updated 10 months ago
- This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.☆156Updated last year
- BH hackathon☆14Updated last year
- inference code for mixtral-8x7b-32kseqlen☆101Updated last year
- Extract information, summarize, ask questions, and search videos using OpenAI's Vision API 🚀🎦☆63Updated last year
- Neural network for creating distortion while keeping embeddings as close as possible☆20Updated last year