CLIPxGPT Captioner is Image Captioning Model based on OpenAI's CLIP and GPT-2.
☆118Feb 17, 2025Updated last year
Alternatives and similar repositories for clip-gpt-captioning
Users that are interested in clip-gpt-captioning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data☆13Sep 30, 2023Updated 2 years ago
- Simple image captioning model☆1,420Jun 9, 2024Updated 2 years ago
- Combining ViT and GPT-2 for image captioning. Trained on MS-COCO. The model was implemented mostly from scratch.☆49Oct 2, 2023Updated 2 years ago
- This repository contains the code and datasets for our ICCV-W paper 'Enhancing CLIP with GPT-4: Harnessing Visual Descriptions as Prompts…☆30Feb 21, 2024Updated 2 years ago
- 基于ClipCap的看图说话Image Caption模型☆324Apr 1, 2022Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆31May 26, 2025Updated last year
- ☆11May 5, 2024Updated 2 years ago
- finetune script for SDXL adapted from waifu-diffusion trainer☆11Aug 21, 2023Updated 2 years ago
- GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)☆199May 9, 2023Updated 3 years ago
- ☆16Jul 17, 2025Updated 10 months ago
- CapDec: SOTA Zero Shot Image Captioning Using CLIP and GPT2, EMNLP 2022 (findings)☆206Jan 28, 2024Updated 2 years ago
- LLM-based character segmentation agent for ComfyUI based on SAM 3 and the SAM 3 Agent notebook☆26Dec 22, 2025Updated 5 months ago
- Data repository for the VALSE benchmark.☆38Feb 15, 2024Updated 2 years ago
- ☆12Jul 20, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Create your own DALL-E application in Python with Streamlit.☆12Mar 9, 2023Updated 3 years ago
- ☆12Nov 6, 2024Updated last year
- COMIC: This is the code repo of our TMM2019 work titled "COMIC: Towards a Compact Image Captioning Model with Attention".☆15Jun 22, 2021Updated 4 years ago
- Official Code for GazeGNN: A Gaze-guided Graph Neural Network for Chest X-ray Classification [WACV 2024]☆21Aug 25, 2023Updated 2 years ago
- Simple repository for training small reasoning models☆52Feb 17, 2026Updated 3 months ago
- An automatic MLLM hallucination detection framework☆19Sep 26, 2023Updated 2 years ago
- Automatically estimator of camera distortion coefficient K1. Inspired by "A Hough Transform-based method for Radial Lens Distortion Corr…☆16Jul 4, 2019Updated 6 years ago
- This repository is for the paper "Is BERT Blind? Exploring the Effect of Vision-and-Language Pretraining on Visual Language Understanding…☆21Nov 2, 2023Updated 2 years ago
- Improving neural network representations using human similarity judgments☆13Nov 22, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Implementation of 'End-to-End Transformer Based Model for Image Captioning' [AAAI 2022]☆70Jun 1, 2024Updated 2 years ago
- Codebase for the paper HawkI: HawkI: Homography & Mutual Information Guidance for 3D-free Single Image to Aerial View☆13Jun 5, 2024Updated 2 years ago
- Solving UCF-101 with fastai2☆28Apr 12, 2023Updated 3 years ago
- PyTorch code for "Fine-grained Image Captioning with CLIP Reward" (Findings of NAACL 2022)☆246Jun 10, 2025Updated last year
- This repository contains code for CVPR 2019 paper "Efficient Video Classification Using Fewer Frames"☆20Mar 10, 2021Updated 5 years ago
- A library for general purpose Mathematical Morphology using the PyTorch framework. to enable GPU computation.☆28Feb 13, 2025Updated last year
- [ICCV 2025] The official pytorch implement of "LLaVA-SP: Enhancing Visual Representation with Visual Spatial Tokens for MLLMs".☆23Oct 28, 2025Updated 7 months ago
- Transformer & CNN Image Captioning model in PyTorch.☆45Mar 7, 2023Updated 3 years ago
- ICLR 2023 DeCap: Decoding CLIP Latents for Zero-shot Captioning☆143Mar 16, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [IJCV2025] https://arxiv.org/abs/2304.04521☆15Jan 22, 2025Updated last year
- Using a CNN-LSTM hybrid network to generate captions for images☆18Nov 19, 2019Updated 6 years ago
- Medical image captioning using OpenAI's CLIP☆98Mar 7, 2023Updated 3 years ago
- Some time series vectorization methods which could give better representation for classification / clustering or other analysis.☆11Jan 4, 2016Updated 10 years ago
- ☆16Mar 9, 2023Updated 3 years ago
- Source code for paper ''Wireless Point Cloud Transmission'' in SPAWC 2024.☆18May 29, 2025Updated last year
- Code and data for the COLING 2020 paper "Try to Substitute: An Unsupervised Chinese Word Sense Disambiguation Method Based on HowNet"☆14Dec 2, 2020Updated 5 years ago