CLIPxGPT Captioner is Image Captioning Model based on OpenAI's CLIP and GPT-2.
☆118Feb 17, 2025Updated last year
Alternatives and similar repositories for clip-gpt-captioning
Users that are interested in clip-gpt-captioning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data☆13Sep 30, 2023Updated 2 years ago
- Retrieval-augmented Image Captioning☆13Feb 16, 2023Updated 3 years ago
- Simple image captioning model☆1,417Jun 9, 2024Updated last year
- Combining ViT and GPT-2 for image captioning. Trained on MS-COCO. The model was implemented mostly from scratch.☆48Oct 2, 2023Updated 2 years ago
- Image Caption Generator implemented using Tensorflow and Keras in a Python Jupyter Notebook. The goal is to describe the content of an im…☆33Feb 17, 2021Updated 5 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- This repository contains the code and datasets for our ICCV-W paper 'Enhancing CLIP with GPT-4: Harnessing Visual Descriptions as Prompts…☆30Feb 21, 2024Updated 2 years ago
- ☆20May 3, 2025Updated last year
- finetune script for SDXL adapted from waifu-diffusion trainer☆11Aug 21, 2023Updated 2 years ago
- GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)☆199May 9, 2023Updated 2 years ago
- CapDec: SOTA Zero Shot Image Captioning Using CLIP and GPT2, EMNLP 2022 (findings)☆205Jan 28, 2024Updated 2 years ago
- ☆22Mar 30, 2021Updated 5 years ago
- LLM-based character segmentation agent for ComfyUI based on SAM 3 and the SAM 3 Agent notebook☆26Dec 22, 2025Updated 4 months ago
- ☆12Jul 20, 2024Updated last year
- ☆46Oct 5, 2025Updated 7 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Conditional Wasserstein Generative Adversarial Network for image-to-image translation.☆25May 24, 2020Updated 5 years ago
- ☆12Nov 6, 2024Updated last year
- Some BMElib (Serre, Bogaert & Christakos) in Python☆42Apr 14, 2026Updated 3 weeks ago
- Simple repository for training small reasoning models☆50Feb 17, 2026Updated 2 months ago
- COMIC: This is the code repo of our TMM2019 work titled "COMIC: Towards a Compact Image Captioning Model with Attention".☆15Jun 22, 2021Updated 4 years ago
- Official Code for GazeGNN: A Gaze-guided Graph Neural Network for Chest X-ray Classification [WACV 2024]☆21Aug 25, 2023Updated 2 years ago
- An automatic MLLM hallucination detection framework☆19Sep 26, 2023Updated 2 years ago
- Segmentation of Satellite Images☆10May 3, 2024Updated 2 years ago
- Implementation of 'End-to-End Transformer Based Model for Image Captioning' [AAAI 2022]☆70Jun 1, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Experimenting with convolutional neural networks (CNN) to detect buildings in satellite imagery☆12Feb 25, 2018Updated 8 years ago
- Codebase for the paper HawkI: HawkI: Homography & Mutual Information Guidance for 3D-free Single Image to Aerial View☆13Jun 5, 2024Updated last year
- Solving UCF-101 with fastai2☆28Apr 12, 2023Updated 3 years ago
- PyTorch code for "Fine-grained Image Captioning with CLIP Reward" (Findings of NAACL 2022)☆246Jun 10, 2025Updated 10 months ago
- This repository contains code for CVPR 2019 paper "Efficient Video Classification Using Fewer Frames"☆20Mar 10, 2021Updated 5 years ago
- Satellite Imagery preprocessing for object detection☆14Aug 21, 2018Updated 7 years ago
- Some papers about *diverse* image (a few videos) captioning☆26Apr 4, 2023Updated 3 years ago
- This project is to identify buildings in satellite using Unet and masking method☆11Apr 12, 2026Updated 3 weeks ago
- Transformer & CNN Image Captioning model in PyTorch.☆44Mar 7, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A modular software data logger for oceanography.☆12Mar 21, 2026Updated last month
- [IJCV2025] https://arxiv.org/abs/2304.04521☆15Jan 22, 2025Updated last year
- Building Extraction based on FCN-4s☆11Aug 4, 2017Updated 8 years ago
- 鼠标动作录制回放工具☆14Sep 12, 2020Updated 5 years ago
- Medical image captioning using OpenAI's CLIP☆97Mar 7, 2023Updated 3 years ago
- Extracting Buildings from Aerial Images☆11Dec 6, 2019Updated 6 years ago
- ROS package for the turtlebot ** Forked**☆11Mar 29, 2013Updated 13 years ago