nocaps-org / image-feature-extractorsView external linksLinks
Feature extraction and visualization scripts for nocaps baselines.
☆18Jan 22, 2021Updated 5 years ago
Alternatives and similar repositories for image-feature-extractors
Users that are interested in image-feature-extractors are comparing it to the libraries listed below
Sorting:
- Novel Object Captioner - Captioning Images with diverse objects☆42Nov 26, 2017Updated 8 years ago
- This repository provides the dataset introduced by our WSSTG paper☆13Jul 21, 2019Updated 6 years ago
- a parody of the ever-increasing amount of papers that appear on arXiv☆37Jan 6, 2025Updated last year
- Dataset and models for paper "Game-Based Video-Context Dialogue (EMNLP 2018)"☆19Oct 25, 2018Updated 7 years ago
- Pytorch version of VidLanKD: Improving Language Understanding viaVideo-Distilled Knowledge Transfer (NeurIPS 2021))☆56Feb 6, 2023Updated 3 years ago
- Official code for the paper "Contrast and Classify: Training Robust VQA Models" published at ICCV, 2021☆19Jul 27, 2021Updated 4 years ago
- ☆22Oct 9, 2021Updated 4 years ago
- Code for the CVPR 2020 paper 'Action Modifiers: Learning from Adverbs in Instructional Videos'☆23May 17, 2021Updated 4 years ago
- ☆54Dec 13, 2019Updated 6 years ago
- ☆11Nov 13, 2024Updated last year
- Website for TextVQA dataset.☆28Apr 30, 2023Updated 2 years ago
- Code for ICML 2019 paper "Probabilistic Neural-symbolic Models for Interpretable Visual Question Answering" [long-oral]☆67Aug 3, 2023Updated 2 years ago
- VisualEchoes Dataset (ECCV 2020)☆35Aug 31, 2021Updated 4 years ago
- Multitask Multilingual Multimodal Pre-training☆73Nov 27, 2022Updated 3 years ago
- ☆33Nov 12, 2018Updated 7 years ago
- Unpaired Image Captioning☆36Mar 25, 2021Updated 4 years ago
- CVPR 2021 Official Pytorch Code for UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training☆34Nov 9, 2021Updated 4 years ago
- Code for paper, "TL;DW? Summarizing Instructional Videos with Task Relevance & Cross-Modal Saliency" ECCV 2022☆39Feb 17, 2023Updated 2 years ago
- Partially Non-Autoregressive Image Captioning☆10Sep 30, 2021Updated 4 years ago
- A Google Chrome Extension that replaces the official New Tab page with a beautiful to-do list.☆12Mar 7, 2018Updated 7 years ago
- Code for Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model☆13Feb 15, 2024Updated last year
- Generalization in Metric Learning: Should the Embedding Layer be the Embedding Layer?☆11Jan 3, 2019Updated 7 years ago
- Disable YubiKey output on MacOS without a modifier key pressed☆10Aug 10, 2022Updated 3 years ago
- Performant physics-focussed quantum circuit library built in Rust☆16Sep 4, 2025Updated 5 months ago
- ECG analysis to classify anterior myocardial infarction cases.☆10May 17, 2017Updated 8 years ago
- Surface segregation using Deep Reinforcement Learning☆13Aug 30, 2021Updated 4 years ago
- This repository contains the source code for TCBee, a TCP flow analysis tool recording packet headers and kernel metrics at up to 1.4 Mpp…☆13Feb 1, 2026Updated last week
- ☆10Nov 17, 2022Updated 3 years ago
- MQTTtimer is based mqtt protocol sync timer☆12Feb 6, 2023Updated 3 years ago
- Show, Edit and Tell: A Framework for Editing Image Captions, CVPR 2020☆81Jul 17, 2020Updated 5 years ago
- ☆13Aug 29, 2022Updated 3 years ago
- The source code and the data for ACL 2022 paper "Show Me More Details: Discovering Hierarchies of Procedures from Semi-structured Web Dat…☆14Apr 21, 2023Updated 2 years ago
- Unofficial Apache Arrow crate that aims to standardize stable hashing of structured data☆13Jan 16, 2026Updated 3 weeks ago
- A puzzle game that uses Real-Time Ray Tracing (RTX) for gameplay and rendering. Implemented in Vulkan 1.2 using VK_KHR_ray_tracing, based…☆12Dec 22, 2021Updated 4 years ago
- Mission: Decentralize the Internet☆11Aug 24, 2023Updated 2 years ago
- Code for the paper "Refining Language Model with Compositional Explanation" (NeurIPS 2021)☆12Oct 25, 2021Updated 4 years ago
- ☆10May 6, 2021Updated 4 years ago
- This repository contains code used for our Multi Sentence Inference NAACL'22 paper.☆12Mar 6, 2023Updated 2 years ago
- Moved: https://codeberg.org/koutheir/lm-sensors☆13Mar 27, 2024Updated last year