soloist97/densecap-pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/soloist97/densecap-pytorch)

soloist97 / densecap-pytorch

A simplified pytorch version of densecap

☆43

Alternatives and similar repositories for densecap-pytorch

Users that are interested in densecap-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yahoo / object_relation_transformer
View on GitHub
Implementation of the Object Relation Transformer for Image Captioning
☆180Sep 17, 2024Updated last year
saccharomycetes / visual_crop_zsvqa
View on GitHub
☆12Apr 10, 2024Updated 2 years ago
InnerPeace-Wu / densecap-tensorflow
View on GitHub
Re-implement CVPR2017 paper: "dense captioning with joint inference and visual context" and minor changes in Tensorflow. (mAP 8.296 after…
☆60Feb 21, 2019Updated 7 years ago
ayouboumani / image-captioning-with-attention
View on GitHub
A Pytorch implementation of the paper 'Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering'
☆10Jan 20, 2020Updated 6 years ago
linjieyangsc / densecap
View on GitHub
Dense captioning with joint inference and visual context
☆52Dec 25, 2018Updated 7 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
MILVLG / bottom-up-attention.pytorch
View on GitHub
A PyTorch reimplementation of bottom-up-attention models
☆302Apr 7, 2022Updated 4 years ago
yekeren / WSSGG
View on GitHub
A weakly-supervised scene graph generation codebase. The implementation of our CVPR2021 paper ``Linguistic Structures as Weak Supervision…
☆37Apr 25, 2021Updated 5 years ago
zjucsq / PLA
View on GitHub
[ICLR2023] Video Scene Graph Generation from Single-Frame Weak Supervision
☆12Sep 17, 2023Updated 2 years ago
JialianW / GRiT
View on GitHub
GRiT: A Generative Region-to-text Transformer for Object Understanding (ECCV2024)
☆341Jan 8, 2024Updated 2 years ago
YiwuZhong / Sub-GC
View on GitHub
[ECCV 2020] Official code for "Comprehensive Image Captioning via Scene Graph Decomposition"
☆99Aug 20, 2024Updated last year
zhung2 / uvtranse
View on GitHub
☆10Jun 1, 2019Updated 7 years ago
Sina-Baharlou / Depth-VRD
View on GitHub
Improving Visual Relation Detection using Depth Maps (ICPR 2020)
☆47Jul 24, 2022Updated 4 years ago
archiki / RepARe
View on GitHub
☆21Oct 10, 2023Updated 2 years ago
yangxuntu / SGAE
View on GitHub
☆218Feb 26, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
chenghuige / chinese_im2text.pytorch
View on GitHub
PyTorch implementation of Chinese image captioning on AI_challenger dataset
☆13Sep 24, 2017Updated 8 years ago
guanghuixu / AnchorCaptioner
View on GitHub
☆30May 7, 2021Updated 5 years ago
andyweizhao / Multitask_Image_Captioning
View on GitHub
☆23Aug 18, 2018Updated 7 years ago
Dong-JinKim / DenseRelationalCaptioning
View on GitHub
Code of Dense Relational Captioning
☆69Feb 23, 2023Updated 3 years ago
BierOne / bottom-up-attention-vqa
View on GitHub
An updated PyTorch implementation of hengyuan-hu's version for 'Bottom-Up and Top-Down Attention for Image Captioning and Visual Question…
☆34Mar 13, 2026Updated 4 months ago
wondervictor / AttendCRNN
View on GitHub
CRNN with Self-Attention
☆10Apr 8, 2018Updated 8 years ago
scwangdyd / large_vocabulary_hoi_detection
View on GitHub
Code for ICCV2021: Discovering Human Interactions with Large-Vocabulary Objects via Query and Multi-Scale Detection
☆28Oct 12, 2021Updated 4 years ago
JacobYuan7 / RLIP
View on GitHub
[NeurIPS 2022 Spotlight] RLIP: Relational Language-Image Pre-training and a series of other methods to solve HOI detection and Scene Grap…
☆78May 26, 2024Updated 2 years ago
entalent / MemCap
View on GitHub
code for paper `MemCap: Memorizing Style Knowledge for Image Captioning`
☆11Mar 17, 2020Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
elastic / workplace-search-python
View on GitHub
Elastic Workplace Search Official Python Client
☆10Aug 8, 2024Updated last year
TerryPei / CSP
View on GitHub
Cross-Self KV Cache Pruning for Efficient Vision-Language Inference
☆10Dec 15, 2024Updated last year
li-xirong / video-retrieval
View on GitHub
Deep Learning for Video Retrieval by Natural Language
☆11Oct 20, 2019Updated 6 years ago
Vision-CAIR / LTVRR
View on GitHub
☆35Oct 21, 2023Updated 2 years ago
yangxuntu / lxmertcatt
View on GitHub
☆79Oct 8, 2022Updated 3 years ago
clin1223 / MTVM
View on GitHub
[ECCV 2022] Multimodal Transformer with Variable-length Memory for Vision-and-Language Navigation
☆19Jul 18, 2022Updated 4 years ago
HaolinLiu97 / Refer-it-in-RGBD
View on GitHub
Repository of our paper 'Refer-it-in-RGBD' in CVPR 2021
☆42May 24, 2024Updated 2 years ago
luisf-gomez / Explorer-FE-AU-in-PD
View on GitHub
This GitHub provides the source code for the paper "Exploring Facial Expression and Action Units in Parkinson Disease"
☆10Dec 21, 2022Updated 3 years ago
ThalesGroup / ConceptBERT
View on GitHub
Implementation of ConceptBert: Concept-Aware Representation for Visual Question Answering
☆31Apr 30, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
volkancirik / refer360
View on GitHub
Repository for ACL2020 paper "Refer360° A Referring Expression Recognition Dataset in 360°Images"
☆15Jun 26, 2021Updated 5 years ago
QiuHeqian / CrossDet
View on GitHub
☆28Jan 9, 2025Updated last year
anthonywchen / MOCHA
View on GitHub
Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".
☆16May 3, 2022Updated 4 years ago
zhijing-jin / GraphIE
View on GitHub
A GCN-based NER framework
☆10Apr 19, 2019Updated 7 years ago
csvconf / data-tables.csv
View on GitHub
“Open terminals”, “load CSVs”, “start hacking”
☆15May 2, 2017Updated 9 years ago
ranjaykrishna / visual_genome_python_driver
View on GitHub
A python wrapper for the Visual Genome API
☆371Sep 21, 2023Updated 2 years ago
forence / Awesome-Visual-Captioning
View on GitHub
This repository focus on Image Captioning & Video Captioning & Seq-to-Seq Learning & NLP
☆410Nov 14, 2022Updated 3 years ago