ANTONIOPSD / CaptionIMG
Simple program to manually caption your images (or any other file types) so you can use them for AI training
☆37Updated 2 years ago
Alternatives and similar repositories for CaptionIMG:
Users that are interested in CaptionIMG are comparing it to the libraries listed below
- Cerule - A Tiny Mighty Vision Model☆67Updated 7 months ago
- ☆30Updated 6 months ago
- (AAAI'25) Training-and-pormpt Free General Painterly Image Harmonization Using image-wise attention sharing☆55Updated 4 months ago
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆63Updated 6 months ago
- A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…☆19Updated 6 months ago
- ☆25Updated 10 months ago
- ☆22Updated last year
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)☆33Updated last month
- Gradio UI for a Cog API☆67Updated last year
- ☆11Updated 10 months ago
- Recaption large (Web)Datasets with vllm and save the artifacts.☆50Updated 5 months ago
- [WIP] Transformer to embed Danbooru labelsets☆13Updated last year
- Official PyTorch implementation of TokenSet.☆114Updated last month
- Video+code lecture on building nanoGPT from scratch☆65Updated 10 months ago
- ☆16Updated last year
- ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing☆67Updated 11 months ago
- ☆63Updated 7 months ago
- Tokun to can tokens☆16Updated last week
- Focused on fast experimentation and simplicity☆71Updated 4 months ago
- ☆28Updated 8 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆91Updated 4 months ago
- faster parallel inference of mochi-1 video generation model☆117Updated 2 months ago
- ☆15Updated last year
- This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.☆152Updated last year
- Using modal.com to process FineWeb-edu data☆20Updated 3 weeks ago
- ☆50Updated last year
- Neural network for creating distortion while keeping embeddings as close as possible☆20Updated last year
- BH hackathon☆14Updated last year
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆80Updated 10 months ago
- look how they massacred my boy☆63Updated 6 months ago