CLIPxGPT Captioner is Image Captioning Model based on OpenAI's CLIP and GPT-2.
☆118Feb 17, 2025Updated last year
Alternatives and similar repositories for clip-gpt-captioning
Users that are interested in clip-gpt-captioning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data☆13Sep 30, 2023Updated 2 years ago
- Retrieval-augmented Image Captioning☆13Feb 16, 2023Updated 3 years ago
- Simple image captioning model☆1,418Jun 9, 2024Updated last year
- This repository contains the code and datasets for our ICCV-W paper 'Enhancing CLIP with GPT-4: Harnessing Visual Descriptions as Prompts…☆30Feb 21, 2024Updated 2 years ago
- An opinionated NLP research template☆10Aug 29, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆20May 3, 2025Updated last year
- GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)☆199May 9, 2023Updated 3 years ago
- ☆16Jul 17, 2025Updated 10 months ago
- CapDec: SOTA Zero Shot Image Captioning Using CLIP and GPT2, EMNLP 2022 (findings)☆205Jan 28, 2024Updated 2 years ago
- ☆11Jan 24, 2022Updated 4 years ago
- LLM-based character segmentation agent for ComfyUI based on SAM 3 and the SAM 3 Agent notebook☆26Dec 22, 2025Updated 5 months ago
- Evaluating Visual Conversational Agents via Cooperative Human-AI Games☆23Nov 22, 2022Updated 3 years ago
- Data repository for the VALSE benchmark.☆38Feb 15, 2024Updated 2 years ago
- Create your own DALL-E application in Python with Streamlit.☆12Mar 9, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Pytorch code for the paper "The color out of space: learning self-supervised representations for Earth Observation imagery"☆18Oct 26, 2021Updated 4 years ago
- An automatic MLLM hallucination detection framework☆19Sep 26, 2023Updated 2 years ago
- Automatically estimator of camera distortion coefficient K1. Inspired by "A Hough Transform-based method for Radial Lens Distortion Corr…☆16Jul 4, 2019Updated 6 years ago
- NICE challenge 2023 Track2 2nd result(total 4th) (CVPR 2023) sponsered by LG AI/Shutterstock/SNU☆11Jun 22, 2023Updated 2 years ago
- This repository is for the paper "Is BERT Blind? Exploring the Effect of Vision-and-Language Pretraining on Visual Language Understanding…☆21Nov 2, 2023Updated 2 years ago
- Implementation of 'End-to-End Transformer Based Model for Image Captioning' [AAAI 2022]☆70Jun 1, 2024Updated last year
- Code repository for "Improving Detection of Small Oriented Objects in Aerial Images", WACV 2023 MaCVi - Best Paper Award☆14May 6, 2023Updated 3 years ago
- This repository contains code for CVPR 2019 paper "Efficient Video Classification Using Fewer Frames"☆20Mar 10, 2021Updated 5 years ago
- PyTorch code for "Fine-grained Image Captioning with CLIP Reward" (Findings of NAACL 2022)☆246Jun 10, 2025Updated 11 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Some papers about *diverse* image (a few videos) captioning☆26Apr 4, 2023Updated 3 years ago
- A library for general purpose Mathematical Morphology using the PyTorch framework. to enable GPU computation.☆28Feb 13, 2025Updated last year
- Transformer & CNN Image Captioning model in PyTorch.☆44Mar 7, 2023Updated 3 years ago
- Official code for "Automated Scoring for Reading Comprehension via In-context BERT Tuning" (AIED 2022)☆13May 23, 2022Updated 4 years ago
- Source code for the paper: "Deep Segmentation of the Mandibular Canal: a New 3D Annotated Dataset of CBCT Volumes.", IEEE Access☆24Dec 5, 2022Updated 3 years ago
- ICLR 2023 DeCap: Decoding CLIP Latents for Zero-shot Captioning☆143Mar 16, 2023Updated 3 years ago
- [IJCV2025] https://arxiv.org/abs/2304.04521☆15Jan 22, 2025Updated last year
- Replace MLP with Kolmogorov-Arnold Network in CNN models and conduct experiments on CIFAR 10☆14May 8, 2024Updated 2 years ago
- Medical image captioning using OpenAI's CLIP☆98Mar 7, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 基于flask的舆情分析系统,包括爬虫、可视化、数据分析、情感分析等模块☆16Jul 30, 2023Updated 2 years ago
- ☆16Mar 9, 2023Updated 3 years ago
- Non-disruptive collagen characterization in clinical histopathology using cross-modality image synthesis☆10Apr 25, 2025Updated last year
- The torchosr module is a set of tools for Open Set Recognition in Python, compatible with PyTorch library.☆13Mar 11, 2025Updated last year
- ROS package for the turtlebot ** Forked**☆11Mar 29, 2013Updated 13 years ago
- A Framework for Symbolic MUsic Graph Explanations☆11Jul 30, 2025Updated 9 months ago
- ☆15Jul 25, 2025Updated 10 months ago