ANTONIOPSD / CaptionIMGLinks
Simple program to manually caption your images (or any other file types) so you can use them for AI training
β37Updated 2 years ago
Alternatives and similar repositories for CaptionIMG
Users that are interested in CaptionIMG are comparing it to the libraries listed below
Sorting:
- Cerule - A Tiny Mighty Vision Modelβ67Updated last year
- Unofficial implementation and experiments related to Set-of-Mark (SoM) ποΈβ87Updated last year
- β63Updated last year
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectioβ¦β84Updated last year
- Gradio UI for a Cog APIβ69Updated last year
- Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.β91Updated last year
- β15Updated last year
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integratβ¦β64Updated last year
- A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom oβ¦β19Updated last year
- BH hackathonβ13Updated last year
- β70Updated last year
- Problem-Oriented Segmentation and Retrieval EMNLP 2024 Findingsβ34Updated 11 months ago
- β30Updated last year
- GRDN.AI app for garden optimizationβ70Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optunaβ55Updated 8 months ago
- β50Updated last year
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Ayaβ116Updated 2 months ago
- Image Generation API Server - Similar to https://text-generator.io but for imagesβ51Updated last month
- Using modal.com to process FineWeb-edu dataβ20Updated 6 months ago
- alternative way to calculating self attentionβ18Updated last year
- Retrieve the source code for any model made available on replicate.com!β34Updated last year
- look how they massacred my boyβ63Updated last year
- run paligemma in real timeβ133Updated last year
- Video+code lecture on building nanoGPT from scratchβ68Updated last year
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Modelsβ22Updated 10 months ago
- [NeurIPS XAIA & Springer] Code and notebooks to paper "A Fresh Look at Sanity Checks for Saliency Maps"β25Updated last year
- A new benchmark for measuring LLM's capability to detect bugs in large codebase.β32Updated last year
- Arxflix turns your boring Arxiv research paper into a captivating video.β55Updated 3 weeks ago
- XmodelLMβ39Updated 10 months ago
- an implementation of Self-Extend, to expand the context window via grouped attentionβ118Updated last year