WuJie1010 / Fine-Grained-Image-CaptioningView external linksLinks
The pytorch implementation on “Fine-Grained Image Captioning with Global-Local Discriminative Objective”
☆21Oct 17, 2019Updated 6 years ago
Alternatives and similar repositories for Fine-Grained-Image-Captioning
Users that are interested in Fine-Grained-Image-Captioning are comparing it to the libraries listed below
Sorting:
- novel deep learning research works with PaddlePaddle☆10May 31, 2020Updated 5 years ago
- Tree-Structured Policy based Progressive Reinforcement Learning for Temporally Language Grounding in Video (AAAI2020)☆47Jan 22, 2020Updated 6 years ago
- ☆10May 10, 2019Updated 6 years ago
- Deliberate Attention Networks for Image Captioning (AAAI 2019)☆11Sep 30, 2019Updated 6 years ago
- A PyTorch implementation of the paper Multimodal Transformer with Multiview Visual Representation for Image Captioning☆25Sep 4, 2020Updated 5 years ago
- A Pytorch implemention for some state-of-the-art models for" Temporally Language Grounding in Untrimmed Videos"☆95Sep 21, 2019Updated 6 years ago
- A curated list of “Temporally Language Grounding” and related area☆110Nov 28, 2019Updated 6 years ago
- code for paper `MemCap: Memorizing Style Knowledge for Image Captioning`☆11Mar 17, 2020Updated 5 years ago
- Code for paper "Image Captioning with End-to-End Attribute Detection and Subsequent Attributes Prediction". IEEE Transactions on Image Pr…☆26Mar 24, 2021Updated 4 years ago
- Source code for "Recurrent Fusion Network for Image Captioning".☆23Nov 24, 2018Updated 7 years ago
- Code for paper "Adaptively Aligned Image Captioning via Adaptive Attention Time". NeurIPS 2019☆51Dec 18, 2019Updated 6 years ago
- implement of Multi-Attention Multi-Class Constraint for Fine-grained Image Recognition with keras☆14Aug 3, 2020Updated 5 years ago
- Simple Tensorflow implementation of "SRM : A Style-based Recalibration Module for Convolutional Neural Networks"☆18May 30, 2019Updated 6 years ago
- Code for "simNet: Stepwise Image-Topic Merging Network for Generating Detailed and Comprehensive Image Captions" (EMNLP 2018)☆36Sep 5, 2018Updated 7 years ago
- The implementation of Text-guided Attention Model for Image Captioning☆21Nov 9, 2017Updated 8 years ago
- video captioning☆24Mar 14, 2019Updated 6 years ago
- Implementation of "Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning" (https://arxiv.…☆26Nov 3, 2018Updated 7 years ago
- This project provides the code and datasets for 'CapSal: Leveraging Captioning to Boost Semantics for Salient Object Detection', CVPR 201…☆50Jan 28, 2020Updated 6 years ago
- Code for Discriminability objective for training descriptive captions(CVPR 2018)☆109Nov 21, 2019Updated 6 years ago
- MTLE method, winner of the Large Scale Movie Description Challenge (LSMDC) 2017 - Video Description Task.☆24Jul 12, 2019Updated 6 years ago
- ☆55May 14, 2020Updated 5 years ago
- ☆28Oct 14, 2024Updated last year
- ☆37Jan 5, 2018Updated 8 years ago
- Reinforcement Learning Based Text Style Transfer without Parallel Training Corpus☆27May 27, 2019Updated 6 years ago
- The Theano code for the CVPR 2017 paper "Semantic Compositional Networks for Visual Captioning"☆69Mar 26, 2018Updated 7 years ago
- Bottom-up features extractor implemented in PyTorch.☆72Dec 5, 2019Updated 6 years ago
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆18Jul 10, 2025Updated 7 months ago
- CVPR 2018 - Regularizing RNNs for Caption Generation by Reconstructing The Past with The Present☆98Dec 19, 2018Updated 7 years ago
- Unpaired Image Captioning☆36Mar 25, 2021Updated 4 years ago
- Deployed a facial emotion recognition using neural network model which predicts the emotion from faces in images, videos and live feed fr…☆11May 2, 2021Updated 4 years ago
- This GitHub repository contains converted models in ONNX, TensorRT, and PyTorch formats, along with inference scripts and demos. These mo…☆14Aug 28, 2023Updated 2 years ago
- ☆11Feb 18, 2022Updated 3 years ago
- ☆11Dec 6, 2024Updated last year
- [CVPR 2025] DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval☆22Jun 23, 2025Updated 7 months ago
- Adversarial Inference for Multi-Sentence Video Descriptions (CVPR 2019)☆34Jul 17, 2019Updated 6 years ago
- Official Pytorch implementation for AAAI2021 paper (RSPNet: Relative Speed Perception for Unsupervised Video Representation Learning)☆37Nov 5, 2021Updated 4 years ago
- Show, Edit and Tell: A Framework for Editing Image Captions, CVPR 2020☆81Jul 17, 2020Updated 5 years ago
- Implementation of the paper All Labels Are Not Created Equal: Enhancing Semi-supervision via Label Grouping and Co-training☆45Feb 1, 2022Updated 4 years ago
- ☆42Apr 7, 2024Updated last year