NovaMind-Z/PTSN

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NovaMind-Z/PTSN)

NovaMind-Z / PTSN

Repository for an end-to-end image captioning method PTSN(ACM MM22).

☆60

Alternatives and similar repositories for PTSN

Users that are interested in PTSN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zchoi / PKOL
View on GitHub
[TIP 2022] Official code of paper “Video Question Answering with Prior Knowledge and Object-sensitive Learning”
☆46Jan 27, 2024Updated 2 years ago
zchoi / S2-Transformer
View on GitHub
[IJCAI 2022] Official Pytorch code for paper “S2 Transformer for Image Captioning”
☆86Aug 14, 2024Updated last year
xiaosu-zhu / McQuic
View on GitHub
Repository of CVPR'22 paper "Unified Multivariate Gaussian Mixture for Efficient Neural Image Compression"
☆119Aug 5, 2024Updated last year
ylhz / FlexAC
View on GitHub
Official implementation for the NeurIPS 2025 paper: "FlexAC: Towards Flexible Control of Associative Reasoning in Multimodal Large Langua…
☆20Apr 25, 2026Updated 3 months ago
232525 / PureT
View on GitHub
Implementation of 'End-to-End Transformer Based Model for Image Captioning' [AAAI 2022]
☆70Jun 1, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
aimagelab / camel
View on GitHub
CaMEL: Mean Teacher Learning for Image Captioning. ICPR 2022
☆30Dec 1, 2022Updated 3 years ago
GT-RIPL / Xmodal-Ctx
View on GitHub
Official PyTorch implementation of our CVPR 2022 paper: Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for …
☆61Oct 21, 2022Updated 3 years ago
VL-Group / DPQ
View on GitHub
☆19Dec 16, 2020Updated 5 years ago
zchoi / VCRN
View on GitHub
☆11Jul 11, 2023Updated 3 years ago
quangvnai / grit
View on GitHub
GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)
☆199May 9, 2023Updated 3 years ago
liujianzhi / EchoReel
View on GitHub
An innovative method designed to augment the capabilities of existing video diffusion models
☆22May 10, 2024Updated 2 years ago
nobody-1617 / DETA
View on GitHub
☆17Apr 5, 2023Updated 3 years ago
bladewaltz1 / ModeCap
View on GitHub
Controllable mage captioning model with unsupervised modes
☆21Apr 14, 2023Updated 3 years ago
kaipengfang / SimHum
View on GitHub
Official github repository for Sim-and-Human Co-training for Data-Efficient and Generalizable Robotic Manipulation.
☆33Mar 13, 2026Updated 4 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
feizc / PNAIC
View on GitHub
Partially Non-Autoregressive Image Captioning
☆10Sep 30, 2021Updated 4 years ago
aimagelab / PMA-Net
View on GitHub
[ICCV 2023] With a Little Help from your own Past: Prototypical Memory Networks for Image Captioning.
☆19Jun 7, 2024Updated 2 years ago
kaipengfang / ProS
View on GitHub
☆19Jul 22, 2024Updated 2 years ago
zchoi / SPT
View on GitHub
[TCSVT23] Official code for "SPT: Spatial Pyramid Transformer for Image Captioning".
☆10Aug 14, 2024Updated last year
ruffiann / MagicVFX
View on GitHub
MagicVFX: Visual Effects Synthesis in Just Minutes
☆18Dec 16, 2024Updated last year
SjokerLily / awesome-image-captioning
View on GitHub
A paper list of image captioning.
☆21Apr 23, 2022Updated 4 years ago
JDAI-CV / image-captioning
View on GitHub
Implementation of 'X-Linear Attention Networks for Image Captioning' [CVPR 2020]
☆273Jul 27, 2021Updated 4 years ago
VL-Group / Natural-Color-Fool
View on GitHub
This repository is the official implementation of [Natural Color Fool: Towards Boosting Black-box Unrestricted Attacks (NeurIPS'22)](http…
☆26Feb 13, 2023Updated 3 years ago
zhangxuying1004 / RSTNet
View on GitHub
Official Code for 'RSTNet: Captioning with Adaptive Attention on Visual and Non-Visual Words' (CVPR 2021)
☆123Dec 17, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
xu-shitong / diffusion-image-captioning
View on GitHub
implementation of paper https://arxiv.org/abs/2210.04559
☆56Nov 26, 2025Updated 7 months ago
luo3300612 / image-captioning-DLCT
View on GitHub
Official pytorch implementation of paper "Dual-Level Collaborative Transformer for Image Captioning" (AAAI 2021).
☆203Jun 8, 2022Updated 4 years ago
xiaosu-zhu / Aurora-Weather
View on GitHub
Aurora Weather
☆24Dec 8, 2016Updated 9 years ago
husthuaan / AoANet
View on GitHub
Code for paper "Attention on Attention for Image Captioning". ICCV 2019
☆339May 2, 2021Updated 5 years ago
CUMTGG / CIIC
View on GitHub
☆18Sep 13, 2023Updated 2 years ago
Gitsamshi / WeakVRD-Captioning
View on GitHub
Implementation of paper "Improving Image Captioning with Better Use of Caption"
☆33Sep 15, 2020Updated 5 years ago
facebookresearch / grid-feats-vqa
View on GitHub
Grid features pre-training code for visual question answering
☆269Sep 17, 2021Updated 4 years ago
dhg-wei / DeCap
View on GitHub
ICLR 2023 DeCap: Decoding CLIP Latents for Zero-shot Captioning
☆144Mar 16, 2023Updated 3 years ago
buxiangzhiren / DDCap
View on GitHub
☆85Dec 4, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ylhz / tf_to_pytorch_model
View on GitHub
Convert tensorflow model to pytorch model via [MMdnn](https://github.com/microsoft/MMdnn) for adversarial attacks.
☆95Dec 1, 2022Updated 3 years ago
qilong-zhang / Targeted_Patch-wise-plusplus_iterative_attack
View on GitHub
The extension of "Patch-wise Attack for Fooling Deep Neural Network (ECCV2020)", and we aim to boost the success rates of targeted attack…
☆28Mar 14, 2022Updated 4 years ago
Junjue-Wang / CapFormer
View on GitHub
[IGARSS 2022] CapFormer: Pure transformer for remote sensing image caption
☆21Oct 6, 2022Updated 3 years ago
rucinfo-Tiffany / LDA_TopicModeling
View on GitHub
Latent dirichlet allocation using Sklearn
☆18Aug 6, 2018Updated 7 years ago
husthuaan / AAT
View on GitHub
Code for paper "Adaptively Aligned Image Captioning via Adaptive Attention Time". NeurIPS 2019
☆50Dec 18, 2019Updated 6 years ago
ayouboumani / image-captioning-with-attention
View on GitHub
A Pytorch implementation of the paper 'Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering'
☆10Jan 20, 2020Updated 6 years ago
ylhz / Adversarial_Attacks_and_Defense_NeurIPS2022
View on GitHub
A list of papers in NeurIPS 2022 related to adversarial attack and defense / AI security.
☆77Dec 5, 2022Updated 3 years ago