pratik18v / sstLinks
My implementation (PyTorch) for the paper SST: Single-Stream Temporal Action Proposals (http://vision.stanford.edu/pdf/buch2017cvpr.pdf).
☆10Updated 2 years ago
Alternatives and similar repositories for sst
Users that are interested in sst are comparing it to the libraries listed below
Sorting:
- Project Uncovering Temporal Context for Video Question and Answering☆14Updated 9 years ago
- Implementation for the CVPR2019 paper "Graphical Contrastive Losses for Scene Graph Parsing"☆12Updated 5 years ago
- Memory-augmented Attention Modelling for Videos☆9Updated 8 years ago
- Code base for zero-shot action localization through spatial-aware object embeddings☆24Updated 7 years ago
- Temporal Context Network for Activity Localization in Videos☆31Updated 7 years ago
- ☆17Updated 7 years ago
- This is a reimplementation of 3D CNN (http://vlg.cs.dartmouth.edu/c3d/). It is compatitable with Caffe 2016. The Caffe is forked from Caf…☆9Updated 8 years ago
- This repository is intended to host tools and demos for ActivityNet☆21Updated 8 years ago
- ☆11Updated 8 years ago
- Localize objects in images using referring expressions☆37Updated 8 years ago
- Mapping Images to Scene Graphs with Permutation-Invariant Structured Prediction☆12Updated 6 years ago
- Diagnostic tools and additional visualizations from "What Actions are Needed for Understanding Human Actions in Videos?" ICCV 2017☆88Updated 7 years ago
- 4th place solution to Google Cloud & YouTube-8M Video Understanding Challenge☆26Updated 8 years ago
- ☆29Updated 8 years ago
- A PyTorch Implementation for our ECCV 2018 paper "Joint Person Segmentation and Identification in Synchronized First- and Third-person Vi…☆12Updated 5 years ago
- Implementation of the Budgeted Super Networks☆25Updated 6 years ago
- https://arxiv.org/abs/1707.00836☆21Updated 7 years ago
- Video captioning using LSTM and CNN. This is the Visual Learning project done by Rui Zhang, Yujia Huang and Yu Zhang☆19Updated 9 years ago
- Referring expression comprehension on ReferIt(RefClef)☆10Updated 8 years ago
- Multi-Target Embodied Question Answering☆11Updated 6 years ago
- Visualize videos, groundtruth annotations, and predictions☆18Updated 2 years ago
- image caption with semantic attention☆11Updated 8 years ago
- Contains code for the EMNLP paper `Learning Linguistic Attributes for Zero-Shot Verb Classification'☆26Updated 7 years ago
- ☆59Updated 7 years ago
- Co-attending Regions and Detections for VQA.☆40Updated 7 years ago
- Author's implementation of the paper "Deep Relative Attributes" (ACCV 2016)☆43Updated 7 years ago
- Finalist entry for the M2CAI Workflow Challenge 2016☆9Updated 8 years ago
- Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for Visual Question Answering☆25Updated 4 years ago
- Charades Object Detection Dataset (ICCV 2017)☆31Updated 7 years ago
- ☆18Updated 8 years ago