Implementation of CVPR2017 paper "A Hierarchical Approach for Generating Descriptive Image Paragraphs" in Tensorflow (in progress...)
☆13Jan 27, 2018Updated 8 years ago
Alternatives and similar repositories for im2p-tensorflow
Users that are interested in im2p-tensorflow are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tensorflow implement of paper: A Hierarchical Approach for Generating Descriptive Image Paragraphs☆49Jul 31, 2018Updated 7 years ago
- Re-implement CVPR2017 paper: "dense captioning with joint inference and visual context" and minor changes in Tensorflow. (mAP 8.296 after…☆60Feb 21, 2019Updated 7 years ago
- Tensorflow implementation of paper: A Hierarchical Approach for Generating Descriptive Image Paragraphs☆15Apr 27, 2018Updated 7 years ago
- A Hierarchical Approach for Generating Descriptive Image Paragraphs☆10Mar 27, 2020Updated 5 years ago
- SelfCriticalSequenceTrainingforImageCaptioning☆21May 27, 2017Updated 8 years ago
- Dense captioning with joint inference and visual context☆53Dec 25, 2018Updated 7 years ago
- Implementation of the CPTR model by https://arxiv.org/pdf/2101.10804.pdf☆10Mar 27, 2022Updated 3 years ago
- [EMNLP 2018] Training for Diversity in Image Paragraph Captioning☆91Sep 12, 2019Updated 6 years ago
- ☆19May 19, 2024Updated last year
- PyTorch code for the paper: "Perceive, Transform, and Act: Multi-Modal Attention Networks for Vision-and-Language Navigation"☆19Aug 5, 2021Updated 4 years ago
- Towards Diverse and Natural Image Descriptions via a Conditional GAN☆75Dec 2, 2017Updated 8 years ago
- The implementation of the model in paper "Skeleton Key: Image Captioning by Skeleton-Attribute Decomposition"☆26Aug 8, 2017Updated 8 years ago
- <Asynchronous Programming in Rust> Chinese translation☆13Dec 30, 2020Updated 5 years ago
- A project for telling stories according to images in some particular style☆16Dec 16, 2018Updated 7 years ago
- [NeurIPS D&B'24]Enhancing vision-language models for medical imaging: bridging the 3D gap with innovative slice selection☆21Nov 25, 2024Updated last year
- Android Studio Project百度地图开发,实现基本定位,移动开发课程的一次实验☆16Nov 30, 2018Updated 7 years ago
- Rethinking the Form of Latent States in Image Captioning☆20Aug 31, 2018Updated 7 years ago
- Code release for Context-Aware Visual Policy Network for Sequence-Level Image Captioning (MM 2018) and Context-Aware Visual Policy Networ…☆46Jul 27, 2019Updated 6 years ago
- Code for the paper "Detecting visual relations using analogies", ICCV19☆21Jan 23, 2020Updated 6 years ago
- Progressive Transformer-Based Generation of Radiology Reports☆25Jan 5, 2025Updated last year
- This is the official repo for "MAN: Moment Alignment Network for Natural Language Moment Retrieval via Iterative Graph Adjustment"☆17May 27, 2019Updated 6 years ago
- ☆12Nov 29, 2017Updated 8 years ago
- Implementation of StarGAN in Tensorflow☆16Apr 15, 2018Updated 7 years ago
- Jointly Measuring Diversity and Quality in Text Generation Models☆26May 16, 2020Updated 5 years ago
- Official pytorch implementation of "Explore-And-Match: Bridging Proposal-Based and Proposal-Free With Transformer for Sentence Grounding …☆42Aug 5, 2022Updated 3 years ago
- ☆15Dec 11, 2023Updated 2 years ago
- Codes for reproducing the adversarial attacks on image captioning systems in “Attacking Visual Language Grounding with Adversarial Examp…☆39Feb 18, 2022Updated 4 years ago
- labs and exercises for EECE.6540 Heterogeneous Computing at UMass Lowell☆13Jun 13, 2023Updated 2 years ago
- Tutorial about doing CMake Right☆15Mar 12, 2020Updated 6 years ago
- Source code for "Recurrent Fusion Network for Image Captioning".☆23Nov 24, 2018Updated 7 years ago
- Adds SPICE metric to coco-caption evaluation server codes☆50Feb 2, 2023Updated 3 years ago
- Code for paper: Image Inpainting with Learnable Bidirectional Attention Maps (ICCV 2019)☆18Nov 5, 2019Updated 6 years ago
- ☆129Dec 5, 2018Updated 7 years ago
- This is Video Swin Transformer to recognize the video with Machine Vision☆19Sep 4, 2021Updated 4 years ago
- Code for the paper: Semantic Conditioned Dynamic Modulation for Temporal Sentence Grounding in Videos☆71Sep 7, 2021Updated 4 years ago
- ☆14Jan 21, 2018Updated 8 years ago
- 🎨 Colorful and pretty themes for HackMD. Most of them are ported from Typora Themes and Obsidian Themes.☆28Mar 10, 2022Updated 4 years ago
- AAAI2020-The official implementation of "Learning Cross-modal Context Graph for Visual Grounding"☆58Oct 25, 2021Updated 4 years ago
- Bottom-up features extractor implemented in PyTorch.☆72Dec 5, 2019Updated 6 years ago