Official code for the paper "Self-Distillation for Few-Shot Image Captioning"
☆18Mar 15, 2021Updated 5 years ago
Alternatives and similar repositories for SD-FSIC
Users that are interested in SD-FSIC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official code for the paper "Predicting Human Scanpaths in Visual Question Answering"☆26Mar 24, 2021Updated 5 years ago
- A PyTorch implementation of the paper Multimodal Transformer with Multiview Visual Representation for Image Captioning☆25Sep 4, 2020Updated 5 years ago
- This is tensorflow 2.2 based SCAMET framework for remote sensing image captioning.☆13Aug 10, 2023Updated 2 years ago
- A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning☆26Jan 20, 2022Updated 4 years ago
- A pytorch implementation of Attention Is All You Need (Transformer) for image captioning.☆12Nov 15, 2021Updated 4 years ago
- A Pytorch implementation of the paper 'Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering'☆10Jan 20, 2020Updated 6 years ago
- ☆36Apr 14, 2021Updated 4 years ago
- The offical Pytorch code for "Continual Attentive Fusion for Incremental Learning in Semantic Segmentation"☆16Apr 8, 2022Updated 3 years ago
- ☆16Dec 7, 2022Updated 3 years ago
- Large-Scale Bidirectional Training for Zero-Shot Image Captioning☆21Feb 14, 2023Updated 3 years ago
- Entity-Aware and Motion-Aware Transformers for Language-driven Action Localization(IJCAI-22)☆12Oct 11, 2022Updated 3 years ago
- Paper implementation☆14Dec 17, 2020Updated 5 years ago
- a py3 lib for NLP & image-caption metrics : BLEU METEOR CIDEr ROUGE SPICE WMD☆14Sep 13, 2022Updated 3 years ago
- ☆15May 30, 2025Updated 9 months ago
- DomainPlus: Cross-Transform Domain Learning towards High Dynamic Range Imaging☆12Oct 11, 2022Updated 3 years ago
- ☆15Dec 13, 2022Updated 3 years ago
- Code for "bootstrap, review, decode: using out-of-domain textual data to improve image captioning"☆21Dec 26, 2016Updated 9 years ago
- Image Chinese Description Generation Based on Multi-level Selective Visual Semantic Attributes☆16Nov 2, 2021Updated 4 years ago
- [ACM MM 2022] MM_Pyramid: Multimodal Pyramid Attentional Network for Audio-Visual Event Localization and Video Parsing☆16Aug 26, 2022Updated 3 years ago
- ☆18Nov 23, 2022Updated 3 years ago
- ☆14Feb 18, 2023Updated 3 years ago
- ☆14Nov 13, 2023Updated 2 years ago
- The official Keras implementation of ACL 2020 paper "Pre-train and Plug-in: Flexible Conditional Text Generation with Variational Auto-En…☆48Nov 4, 2022Updated 3 years ago
- Poet: Product-oriented Video Captioner for E-commerce☆12Sep 21, 2020Updated 5 years ago
- Official repository of paper "LOVE-R1: Advancing Long Video Understanding with Adaptive Zoom-in Mechanism via Multi-Step Reasoning"☆22Nov 1, 2025Updated 4 months ago
- An official implementation for MS-DETR in ACL'23☆17Jun 3, 2023Updated 2 years ago
- Show, Edit and Tell: A Framework for Editing Image Captions, CVPR 2020☆82Jul 17, 2020Updated 5 years ago
- Pytorch Implementation of ECCV'22 paper: Video Activity Localisation with Uncertainties in Temporal Boundary☆17Jul 17, 2022Updated 3 years ago
- [ICCV 2025] Official Implementation of "Shot-by-Shot: Film-Grammar-Aware Training-Free Audio Description Generation". Junyu Xie, Tengda H…☆21Jul 26, 2025Updated 7 months ago
- SVGD implementation☆12Jul 23, 2018Updated 7 years ago
- Semi-supervised Semantic Segmentation on the ImageNet-S dataset☆21Mar 20, 2023Updated 3 years ago
- Official code for the paper "Image generation with shortest path diffusion" accepted at ICML 2023.☆24Jul 10, 2023Updated 2 years ago
- Code for paper "Attention on Attention for Image Captioning". ICCV 2019☆339May 2, 2021Updated 4 years ago
- PyTorch implementation of image captioning with adaptive attention mechanism.☆18Mar 23, 2019Updated 7 years ago
- [ACM MM 2022] This is the official implementation of "Temporal Sentiment Localization: Listen and Look in Untrimmed Videos"☆18Feb 14, 2025Updated last year
- 一个基于 sm.ms 图床 v2 API 的图片管理工具,除了普通的图片上传外还有在命令行下进行历史提交图片的管理,个人账户信息查看等功能。☆24Oct 14, 2019Updated 6 years ago
- The pytorch implementation on “Fine-Grained Image Captioning with Global-Local Discriminative Objective”☆21Oct 17, 2019Updated 6 years ago
- ☆17Aug 14, 2024Updated last year
- Implementation code of the work "Exploiting Multiple Sequence Lengths in Fast End to End Training for Image Captioning"☆96Dec 25, 2024Updated last year