An easy to use, user-friendly and efficient code for extracting OpenAI CLIP (Global/Grid) features from image and text respectively.
☆139Jan 1, 2025Updated last year
Alternatives and similar repositories for OpenAI-CLIP-Feature
Users that are interested in OpenAI-CLIP-Feature are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Dec 7, 2022Updated 3 years ago
- Some papers about *diverse* image (a few videos) captioning☆26Apr 4, 2023Updated 2 years ago
- Implementation of 'End-to-End Transformer Based Model for Image Captioning' [AAAI 2022]☆69Jun 1, 2024Updated last year
- ☆13Jun 2, 2023Updated 2 years ago
- GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)☆198May 9, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This is tensorflow 2.2 based SCAMET framework for remote sensing image captioning.☆13Aug 10, 2023Updated 2 years ago
- Repository for an end-to-end image captioning method PTSN(ACM MM22).☆60Dec 11, 2022Updated 3 years ago
- [ICLR 2022] code for "How Much Can CLIP Benefit Vision-and-Language Tasks?" https://arxiv.org/abs/2107.06383☆420Oct 28, 2022Updated 3 years ago
- Controllable mage captioning model with unsupervised modes☆21Apr 14, 2023Updated 2 years ago
- [ACL 2023] Code and data for our paper "Measuring Progress in Fine-grained Vision-and-Language Understanding"☆13Jun 11, 2023Updated 2 years ago
- Official PyTorch implementation of our CVPR 2022 paper: Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for …☆61Oct 21, 2022Updated 3 years ago
- Deliberate Attention Networks for Image Captioning (AAAI 2019)☆11Sep 30, 2019Updated 6 years ago
- This is the official code for "Bootstrapping Interactive Image-Text Alignment for Remote Sensing Image Captioning"☆35Dec 24, 2024Updated last year
- [CVPR23] A cascaded diffusion captioning model with a novel semantic-conditional diffusion process that upgrades conventional diffusion m…☆67Jun 11, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- 中科大跨模态智能组-每周论文分享☆16Nov 20, 2022Updated 3 years ago
- ☆20May 3, 2025Updated 10 months ago
- CaMEL: Mean Teacher Learning for Image Captioning. ICPR 2022☆29Dec 1, 2022Updated 3 years ago
- (ACL'2023) MultiCapCLIP: Auto-Encoding Prompts for Zero-Shot Multilingual Visual Captioning☆36Aug 8, 2024Updated last year
- Plotting heatmaps with the self-attention of the [CLS] tokens in the last layer.☆50May 11, 2022Updated 3 years ago
- The first unofficial implementation of CLIP4Caption: CLIP for Video Caption (ACMMM 2021)☆16Jan 2, 2023Updated 3 years ago
- Video Feature Extraction Code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"☆117Jun 9, 2021Updated 4 years ago
- Source code for paper "ATP: AMRize Than Parse! Enhancing AMR Parsing with PseudoAMRs" @NAACL-2022☆15Mar 31, 2023Updated 3 years ago
- Code for the paper "Controllable Video Captioning with an Exemplar Sentence"☆12Apr 14, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The implementation of a paper entitled "Action Knowledge for Video Captioning with Graph Neural Networks" (JKSUCIS 2023).☆14Mar 29, 2023Updated 3 years ago
- [ICCV 2023] With a Little Help from your own Past: Prototypical Memory Networks for Image Captioning.☆19Jun 7, 2024Updated last year
- Implementation for CVPR 2022 paper " Injecting Semantic Concepts into End-to-End Image Captionin".☆43May 28, 2022Updated 3 years ago
- 🚀 Implementaton of SO-SLAM [unofficial]☆32May 10, 2023Updated 2 years ago
- Video captioning baseline models on Video2Commonsense Dataset.☆56Apr 15, 2021Updated 4 years ago
- Simple image captioning model☆1,414Jun 9, 2024Updated last year
- Source code for EMNLP 2022 paper “PEVL: Position-enhanced Pre-training and Prompt Tuning for Vision-language Models”☆49Nov 10, 2022Updated 3 years ago
- Official Implementation of Attentive Mask CLIP (ICCV2023, https://arxiv.org/abs/2212.08653)☆36May 29, 2024Updated last year
- Implementation of Self-conditioned Embedding Diffusion for Text Generation☆39Jan 6, 2023Updated 3 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Code for paper "Image Captioning with End-to-End Attribute Detection and Subsequent Attributes Prediction". IEEE Transactions on Image Pr…☆26Mar 24, 2021Updated 5 years ago
- Python implementation of extraction of several visual features representations from videos☆23Jul 19, 2021Updated 4 years ago
- IJCAI2020: Learning to Discretely Compose Reasoning Module Networks for Video Captioning☆79Nov 23, 2020Updated 5 years ago
- This repository focus on Image Captioning & Video Captioning & Seq-to-Seq Learning & NLP☆411Nov 14, 2022Updated 3 years ago
- Official Code for "Knowing what it is: Semantic-enhanced Dual Attention Transformer" (TMM2022)☆19Oct 15, 2022Updated 3 years ago
- Code for Unsupervised Image Captioning☆223Mar 24, 2023Updated 3 years ago
- ☆11Sep 15, 2023Updated 2 years ago