PyTorch code for Learning to Caption Images through a Lifetime by Asking Questions (ICCV 2019)
β16Sep 17, 2019Updated 6 years ago
Alternatives and similar repositories for Caption-Lifetime-by-Asking-Questions
Users that are interested in Caption-Lifetime-by-Asking-Questions are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- β27May 4, 2020Updated 6 years ago
- π PyTorch Implementation for EMNLP'21 Findings "Reasoning Visual Dialog with Sparse Graph Learning and Knowledge Transfer"β13Feb 1, 2023Updated 3 years ago
- Convert data to their natural (human-readable) formatβ30Nov 4, 2021Updated 4 years ago
- β¨ Official PyTorch Implementation for EMNLP'19 Paper, "Dual Attention Networks for Visual Reference Resolution in Visual Dialog"β44Mar 19, 2023Updated 3 years ago
- Baseline model for nocaps benchmark, ICCV 2019 paper "nocaps: novel object captioning at scale".β77Oct 3, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Dataset and code corresponding to Associating Natural Language Comment and Source Code Entities (AAAI 2020)β20Oct 24, 2020Updated 5 years ago
- Models for the Collaborative Drawing (CoDraw) taskβ13Jan 15, 2019Updated 7 years ago
- Library for preprocessing java source code into Augmented ASTs, as per the paper Open Vocabulary Learning on Source Code with a Graph-Strβ¦β21Oct 22, 2018Updated 7 years ago
- visual dialog model in pytorchβ110May 16, 2018Updated 8 years ago
- https://nv-tlabs.github.io/semanticGAN/β13Oct 23, 2023Updated 2 years ago
- PyTorch implementation of the Reinforced Mnemonic Reader + Answer Verifier model (https://arxiv.org/abs/1808.05759)β10Nov 23, 2018Updated 7 years ago
- Rethinking the Form of Latent States in Image Captioningβ20Aug 31, 2018Updated 7 years ago
- Code for the paper BiST: Bi-directional Spatio-Temporal Reasoning for Video-Grounded Dialogues (EMNLP20)β11Jun 16, 2025Updated 11 months ago
- maskrcnn implementation using chainerβ14Jun 12, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for ''A Simple Baseline for Audio-Visual Scene-Aware Dialog``β27May 26, 2020Updated 6 years ago
- Implementation of FPN (Feature Pyramid Networks) using Chainerβ14Feb 8, 2019Updated 7 years ago
- PyTorch code for Learning Cooperative Visual Dialog Agents using Deep Reinforcement Learningβ169Oct 10, 2018Updated 7 years ago
- PororoQA, https://arxiv.org/abs/1707.00836β27Sep 16, 2022Updated 3 years ago
- π¦Ύ PyTorch Implementation for the ICRA'24 Paper, "PROGrasp: Pragmatic Human-Robot Communication for Object Grasping"β15May 5, 2025Updated last year
- A PyTorch implementation of Dual Attention Networkβ30Mar 27, 2022Updated 4 years ago
- MTLE method, winner of the Large Scale Movie Description Challenge (LSMDC) 2017 - Video Description Task.β24Jul 12, 2019Updated 6 years ago
- MAVERICS (Manually-vAlidated Vq^2a Examples fRom Image-Caption datasetS) is a suite of test-only benchmarks for visual question answeringβ¦β13Feb 18, 2023Updated 3 years ago
- Use transformer for captioningβ156May 2, 2019Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Placeholder for code of BSP.β11Aug 13, 2021Updated 4 years ago
- β10Dec 28, 2018Updated 7 years ago
- β18Jun 10, 2024Updated last year
- code for running trained model from Visual Reasoning by Progressive Module Networks (ICLR19)β15Jan 30, 2019Updated 7 years ago
- β30Oct 2, 2018Updated 7 years ago
- Tensorflow implement of paper: Optimization of image description metrics using policy gradient methodsβ29Jul 31, 2018Updated 7 years ago
- Training and testing code from our CVPR 2023 paper "Are Deep Neural Networks SMARTer than Second Graders?"β11Aug 10, 2023Updated 2 years ago
- This is a repository contains the implementation of our NeurIPS'24 paper "Temporal Sentence Grounding with Relevance Feedback in Videos"β13Aug 22, 2025Updated 9 months ago
- Code for the paper Non-Autoregressive Dialog State Tracking (ICLR20)β44Feb 25, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- β11Feb 18, 2022Updated 4 years ago
- Generating Easy-to-Understand Referring Expressions for Target Identificationsβ18Aug 30, 2019Updated 6 years ago
- awesome video-based self-supervised learning methods in recently yearsβ10Nov 26, 2020Updated 5 years ago
- Develop ultimate AI PokΓ©mon trainerβ20Jun 24, 2025Updated 11 months ago
- Are Binary Annotations Sufficient? Video Moment Retrieval via Hierarchical Uncertainty-based Active Learningβ15Dec 12, 2023Updated 2 years ago
- β12Mar 8, 2021Updated 5 years ago
- β10Mar 30, 2022Updated 4 years ago