A simplified pytorch version of densecap
☆43Dec 11, 2024Updated last year
Alternatives and similar repositories for densecap-pytorch
Users that are interested in densecap-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of the Object Relation Transformer for Image Captioning☆180Sep 17, 2024Updated last year
- ☆12Apr 10, 2024Updated 2 years ago
- Official code repository for the EMNLP 2021 paper☆26Jan 30, 2022Updated 4 years ago
- Re-implement CVPR2017 paper: "dense captioning with joint inference and visual context" and minor changes in Tensorflow. (mAP 8.296 after…☆60Feb 21, 2019Updated 7 years ago
- A PyTorch implementation of the paper Multimodal Transformer with Multiview Visual Representation for Image Captioning☆25Sep 4, 2020Updated 5 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- A PyTorch reimplementation of bottom-up-attention models☆301Apr 7, 2022Updated 4 years ago
- [ICLR2023] Video Scene Graph Generation from Single-Frame Weak Supervision☆12Sep 17, 2023Updated 2 years ago
- GRiT: A Generative Region-to-text Transformer for Object Understanding (ECCV2024)☆341Jan 8, 2024Updated 2 years ago
- A weakly-supervised scene graph generation codebase. The implementation of our CVPR2021 paper ``Linguistic Structures as Weak Supervision…☆37Apr 25, 2021Updated 5 years ago
- [ECCV 2020] Official code for "Comprehensive Image Captioning via Scene Graph Decomposition"☆99Aug 20, 2024Updated last year
- A Pytorch implementation of the paper 'Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering'☆10Jan 20, 2020Updated 6 years ago
- ☆10Jun 1, 2019Updated 7 years ago
- Code for ICCV2021: Discovering Human Interactions with Large-Vocabulary Objects via Query and Multi-Scale Detection☆28Oct 12, 2021Updated 4 years ago
- Improving Visual Relation Detection using Depth Maps (ICPR 2020)☆47Jul 24, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆21Oct 10, 2023Updated 2 years ago
- ☆218Feb 26, 2022Updated 4 years ago
- PyTorch implementation of Chinese image captioning on AI_challenger dataset☆13Sep 24, 2017Updated 8 years ago
- 实现风格与torchvision.models中的网络类似☆11May 30, 2021Updated 5 years ago
- Code of Dense Relational Captioning☆69Feb 23, 2023Updated 3 years ago
- An updated PyTorch implementation of hengyuan-hu's version for 'Bottom-Up and Top-Down Attention for Image Captioning and Visual Question…☆34Mar 13, 2026Updated 3 months ago
- Character-Preserving Coherent Story Visualization, ECCV 2020☆42Mar 12, 2021Updated 5 years ago
- CRNN with Self-Attention☆10Apr 8, 2018Updated 8 years ago
- Cross-Self KV Cache Pruning for Efficient Vision-Language Inference☆10Dec 15, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ICCV 2021: A brand new hub for Scene Graph Generation methods based on MMdetection (2021). The pipeline of from detection, scene graph ge…☆65Oct 12, 2021Updated 4 years ago
- Image Chinese Description Generation Based on Multi-level Selective Visual Semantic Attributes☆16Nov 2, 2021Updated 4 years ago
- ☆35Oct 21, 2023Updated 2 years ago
- ☆79Oct 8, 2022Updated 3 years ago
- Commonsense Knowledge Base Reasoning☆10Sep 3, 2018Updated 7 years ago
- [ECCV 2022] Multimodal Transformer with Variable-length Memory for Vision-and-Language Navigation☆20Jul 18, 2022Updated 3 years ago
- PyTorch implementation of video captioning☆13Sep 24, 2017Updated 8 years ago
- The official implementation of the paper **LVChat: Facilitating Long Video Comprehension**☆14Apr 15, 2024Updated 2 years ago
- This GitHub provides the source code for the paper "Exploring Facial Expression and Action Units in Parkinson Disease"☆10Dec 21, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Implementation of ConceptBert: Concept-Aware Representation for Visual Question Answering☆31Apr 30, 2024Updated 2 years ago
- Repository for ACL2020 paper "Refer360° A Referring Expression Recognition Dataset in 360°Images"☆14Jun 26, 2021Updated 4 years ago
- ☆28Jan 9, 2025Updated last year
- ☆15Mar 16, 2026Updated 3 months ago
- A GCN-based NER framework☆10Apr 19, 2019Updated 7 years ago
- Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".☆16May 3, 2022Updated 4 years ago
- A python wrapper for the Visual Genome API☆370Sep 21, 2023Updated 2 years ago