0x3bfc / Docker-ComposeLinks
Docker Compose Service Integration Files
☆15Updated 10 years ago
Alternatives and similar repositories for Docker-Compose
Users that are interested in Docker-Compose are comparing it to the libraries listed below
Sorting:
- Code for "bootstrap, review, decode: using out-of-domain textual data to improve image captioning"☆21Updated 9 years ago
- Using LLMs and pre-trained caption models for super-human performance on image captioning.☆42Updated 2 years ago
- Partially Non-Autoregressive Image Captioning☆10Updated 4 years ago
- Official PyTorch Implementation for CVPR'23 Paper, "The Dialog Must Go On: Improving Visual Dialog via Generative Self-Training"☆20Updated 2 years ago
- Official python implementation of R3-Transformer☆15Updated 5 years ago
- Official Implementation for CVPR 2023 paper "Divide and Conquer: Answering Questions with Object Factorization and Compositional Reasonin…☆10Updated last year
- ☆45Updated 2 years ago
- Code for the paper "Controllable Video Captioning with an Exemplar Sentence"☆12Updated 4 years ago
- ☆19Updated 3 years ago
- [EMNLP 2021] Code and data for our paper "Vision-and-Language or Vision-for-Language? On Cross-Modal Influence in Multimodal Transformers…☆20Updated 4 years ago
- Code for Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model☆13Updated last year
- ☆13Updated 3 years ago
- [EMNLP 2024] IFCap: Image-like Retrieval and Frequency-based Entity Filtering for Zero-shot Captioning☆15Updated 8 months ago
- ☆13Updated 3 years ago
- Repo for ICCV 2021 paper: Beyond Question-Based Biases: Assessing Multimodal Shortcut Learning in Visual Question Answering☆28Updated last year
- Code and data for the project "Visually grounded continual learning of compositional semantics"☆22Updated 3 years ago
- ☆22Updated 3 weeks ago
- A Python toolkit for the OmniLabel benchmark providing code for evaluation and visualization☆23Updated last year
- The code and datasets of our ACM MM 2024 paper "Hallu-PI: Evaluating Hallucination in Multi-modal Large Language Models within Perturbed …☆11Updated last year
- ☆20Updated 9 months ago
- Official code for the paper "Contrast and Classify: Training Robust VQA Models" published at ICCV, 2021☆19Updated 4 years ago
- The Pytorch implementation for "Video-Text Pre-training with Learned Regions"☆42Updated 3 years ago
- The download link for the dataset LAD.☆41Updated 6 years ago
- Homework in the Algorithms: Design and Analysis, Part 1 course offered on Coursera☆38Updated 12 years ago
- Code for the CVPR 2020 oral paper: Weakly Supervised Visual Semantic Parsing☆33Updated 3 years ago
- Let there be clock in the beach - WACV 2022☆15Updated 4 years ago
- ☆15Updated 3 years ago
- Colorful Prompt Tuning for Pre-trained Vision-Language Models☆49Updated 3 years ago
- Pytorch version of DeCEMBERT: Learning from Noisy Instructional Videos via Dense Captions and Entropy Minimization (NAACL 2021)☆17Updated 3 years ago
- Perceptual Grouping in Contrastive Vision-Language Models (ICCV'23)☆37Updated 2 years ago