ANTONIOPSD / CaptionIMG
Simple program to manually caption your images (or any other file types) so you can use them for AI training
☆37Updated last year
Related projects ⓘ
Alternatives and complementary repositories for CaptionIMG
- Cerule - A Tiny Mighty Vision Model☆67Updated 2 months ago
- Training-and-pormpt Free General Painterly Image Harmonization Using image-wise attention sharing☆52Updated 5 months ago
- Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️☆77Updated last year
- ☆30Updated last month
- Problem-Oriented Segmentation and Retrieval EMNLP 2024 Findings☆28Updated last week
- Framework agnostic computer vision inference. Run 1000+ models by changing only one line of code. Supports models from transformers, timm…☆119Updated this week
- Gradio UI for a Cog API☆64Updated 7 months ago
- ☆62Updated last month
- Official repo for EMNLP 2023 paper "Explain-then-Translate: An Analysis on Improving Program Translation with Self-generated Explanations…☆27Updated 11 months ago
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated 10 months ago
- ☆25Updated 3 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆20Updated 9 months ago
- faster parallel inference of mochi-1 video generation model☆73Updated last week
- Synthetic data generator for image, video and 3D models☆29Updated 3 months ago
- Simplex Random Feature attention, in PyTorch☆71Updated last year
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆49Updated this week
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆63Updated last month
- ☆59Updated 5 months ago
- Extract information, summarize, ask questions, and search videos using OpenAI's Vision API 🚀🎦☆61Updated last year
- Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.☆87Updated 4 months ago
- ☆48Updated last year
- Fast Real-time Object Detection with High-Res Output https://x.com/_akhaliq/status/1840213012818329826☆52Updated last month
- Using modal.com to process FineWeb-edu data☆19Updated 2 months ago
- This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.☆153Updated last year
- look how they massacred my boy☆58Updated last month
- BH hackathon☆14Updated 7 months ago
- Open-source Python toolkit focused on deep learning with ordinal methodologies☆31Updated 3 weeks ago
- alternative way to calculating self attention☆18Updated 5 months ago
- Routing on Random Forest (RoRF)☆84Updated last month
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆77Updated 5 months ago