A simplified pytorch version of densecap
☆43Dec 11, 2024Updated last year
Alternatives and similar repositories for densecap-pytorch
Users that are interested in densecap-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of the Object Relation Transformer for Image Captioning☆180Sep 17, 2024Updated last year
- ☆11Apr 10, 2024Updated last year
- Re-implement CVPR2017 paper: "dense captioning with joint inference and visual context" and minor changes in Tensorflow. (mAP 8.296 after…☆60Feb 21, 2019Updated 7 years ago
- A PyTorch implementation of the paper Multimodal Transformer with Multiview Visual Representation for Image Captioning☆25Sep 4, 2020Updated 5 years ago
- Dense captioning with joint inference and visual context☆53Dec 25, 2018Updated 7 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A PyTorch reimplementation of bottom-up-attention models☆301Apr 7, 2022Updated 3 years ago
- [ICLR2023] Video Scene Graph Generation from Single-Frame Weak Supervision☆12Sep 17, 2023Updated 2 years ago
- GRiT: A Generative Region-to-text Transformer for Object Understanding (ECCV2024)☆341Jan 8, 2024Updated 2 years ago
- [ECCV 2020] Official code for "Comprehensive Image Captioning via Scene Graph Decomposition"☆99Aug 20, 2024Updated last year
- A Pytorch implementation of the paper 'Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering'☆10Jan 20, 2020Updated 6 years ago
- ☆10Jun 1, 2019Updated 6 years ago
- Code for ICCV2021: Discovering Human Interactions with Large-Vocabulary Objects via Query and Multi-Scale Detection☆28Oct 12, 2021Updated 4 years ago
- Improving Visual Relation Detection using Depth Maps (ICPR 2020)☆47Jul 24, 2022Updated 3 years ago
- ☆218Feb 26, 2022Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Video Captioning on MSR-VTT and MSVD dataset using Deep Learning☆21Aug 14, 2020Updated 5 years ago
- ☆30May 7, 2021Updated 4 years ago
- Official repository of paper "LOVE-R1: Advancing Long Video Understanding with Adaptive Zoom-in Mechanism via Multi-Step Reasoning"☆23Nov 1, 2025Updated 4 months ago
- ☆23Aug 18, 2018Updated 7 years ago
- [CVPR 2024 CVinW] Multi-Agent VQA: Exploring Multi-Agent Foundation Models on Zero-Shot Visual Question Answering☆20Sep 21, 2024Updated last year
- Code of Dense Relational Captioning☆69Feb 23, 2023Updated 3 years ago
- An updated PyTorch implementation of hengyuan-hu's version for 'Bottom-Up and Top-Down Attention for Image Captioning and Visual Question…☆35Mar 13, 2026Updated 2 weeks ago
- Elastic Workplace Search Official Python Client☆10Aug 8, 2024Updated last year
- CRNN with Self-Attention☆10Apr 8, 2018Updated 7 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- code for paper `MemCap: Memorizing Style Knowledge for Image Captioning`☆11Mar 17, 2020Updated 6 years ago
- Cross-Self KV Cache Pruning for Efficient Vision-Language Inference☆10Dec 15, 2024Updated last year
- ICCV 2021: A brand new hub for Scene Graph Generation methods based on MMdetection (2021). The pipeline of from detection, scene graph ge…☆63Oct 12, 2021Updated 4 years ago
- Image Chinese Description Generation Based on Multi-level Selective Visual Semantic Attributes☆16Nov 2, 2021Updated 4 years ago
- ☆35Oct 21, 2023Updated 2 years ago
- Deep Learning for Video Retrieval by Natural Language☆11Oct 20, 2019Updated 6 years ago
- ☆79Oct 8, 2022Updated 3 years ago
- Repository of our paper 'Refer-it-in-RGBD' in CVPR 2021☆42May 24, 2024Updated last year
- [ECCV 2022] Multimodal Transformer with Variable-length Memory for Vision-and-Language Navigation☆19Jul 18, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- PyTorch implementation of video captioning☆13Sep 24, 2017Updated 8 years ago
- The official implementation of the paper **LVChat: Facilitating Long Video Comprehension**☆14Apr 15, 2024Updated last year
- Repository for ACL2020 paper "Refer360° A Referring Expression Recognition Dataset in 360°Images"☆13Jun 26, 2021Updated 4 years ago
- ☆13Apr 11, 2022Updated 3 years ago
- A GCN-based NER framework☆10Apr 19, 2019Updated 6 years ago
- A python wrapper for the Visual Genome API☆367Sep 21, 2023Updated 2 years ago
- “Open terminals”, “load CSVs”, “start hacking”☆16May 2, 2017Updated 8 years ago