ANTONIOPSD / CaptionIMG
Simple program to manually caption your images (or any other file types) so you can use them for AI training
☆37Updated last year
Alternatives and similar repositories for CaptionIMG:
Users that are interested in CaptionIMG are comparing it to the libraries listed below
- Cerule - A Tiny Mighty Vision Model☆67Updated 4 months ago
- A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…☆17Updated 3 months ago
- ☆30Updated 3 months ago
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆63Updated 3 months ago
- Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️☆80Updated last year
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆100Updated last week
- Recaption large (Web)Datasets with vllm and save the artifacts.☆44Updated last month
- Official repo for EMNLP 2023 paper "Explain-then-Translate: An Analysis on Improving Program Translation with Self-generated Explanations…☆27Updated last year
- ☆24Updated 7 months ago
- ☆62Updated 3 months ago
- (AAAI'25) Training-and-pormpt Free General Painterly Image Harmonization Using image-wise attention sharing☆54Updated last month
- Easy to use, High Performant Knowledge Distillation for LLMs☆38Updated last week
- Full finetuning of large language models without large memory requirements☆93Updated last year
- [WIP] Transformer to embed Danbooru labelsets☆13Updated 9 months ago
- Gradio UI for a Cog API☆65Updated 9 months ago
- Use the Moondream 2 model to detect faces and their gaze directions in videos.☆39Updated this week
- The code repository for the CURLoRA research paper. Stable LLM continual fine-tuning and catastrophic forgetting mitigation.☆41Updated 4 months ago
- Video+code lecture on building nanoGPT from scratch☆65Updated 7 months ago
- ☆51Updated 2 months ago
- ☆15Updated last year
- Focused on fast experimentation and simplicity☆64Updated 3 weeks ago
- ☆48Updated last year
- ☆26Updated 10 months ago
- working implimention of deepseek MLA☆23Updated last week
- faster parallel inference of mochi-1 video generation model☆107Updated 3 weeks ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆81Updated 3 weeks ago
- This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM☆56Updated 7 months ago
- Image Generation API Server - Similar to https://text-generator.io but for images☆49Updated last month
- ☆27Updated 5 months ago
- Command-line script for inferencing from models such as WizardCoder☆26Updated last year