soCzech / MultiTaskObjectStatesLinks

Code for the paper "Multi-Task Learning of Object States and State-Modifying Actions from Web Videos" published in TPAMI

☆11

Alternatives and similar repositories for MultiTaskObjectStates

Users that are interested in MultiTaskObjectStates are comparing it to the libraries listed below

Sorting:

soCzech / ChangeIt
ChangeIt dataset with more than 2600 hours of video with state-changing actions published at CVPR 2022
☆11Updated 3 years ago
epic-kitchens / epic-kitchens-100-hand-object-bboxes
A repo for processing the raw hand object detections to produce releasable pickles + library for using these
☆39Updated last year
Buzz-Beater / LEMMA
Code for ECCV 2020 paper - LEMMA: A Multi-view Dataset for LEarning Multi-agent Multi-task Activities
☆30Updated 4 years ago
jalayrac / object-states-action
Code for the paper Joint Discovery of Object States and Manipulation Actions, ICCV 2017
☆14Updated 7 years ago
facebookresearch / ego-topo
Code accompanying EGO-TOPO: Environment Affordances from Egocentric Video (CVPR 2020)
☆31Updated 3 years ago
soCzech / LookForTheChange
Code for Look for the Change paper published at CVPR 2022
☆36Updated 3 years ago
Tushar-N / interaction-hotspots
Learning interaction hotspots from egocentric video
☆52Updated 2 years ago
antoine77340 / RareAct
RareAct: A video dataset of unusual interactions
☆33Updated 5 years ago
epic-kitchens / C1-Action-Recognition-TSN-TRN-TSM
EPIC-Kitchens-100 Action Recognition baselines: TSN, TRN, TSM
☆32Updated 3 years ago
fpv-iplab / stillfast
Code for the paper: F. Ragusa, G. M. Farinella, A. Furnari. StillFast: An End-to-End Approach for Short-Term Object Interaction Anticipat…
☆13Updated 2 years ago
kuanfang / opra
Online Product Reviews for Affordances
☆23Updated 6 years ago
2020aptx4869lm / Forecasting-Human-Object-Interaction-in-FPV
☆25Updated 6 years ago
DmZhukov / CrossTask
☆93Updated 3 years ago
princeton-vl / Rel3D
Official code for NeurRIPS 2020 paper "Rel3D: A Minimally Contrastive Benchmark for Grounding Spatial Relations in 3D"
☆31Updated 11 months ago
NVlabs / RelViT
[ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning
☆63Updated 3 years ago
ykztawas / Weakly-Supervised-Affordance-Detection
☆28Updated 6 years ago
brown-palm / AntGPT
Official code implemtation of paper AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos?
☆26Updated last year
EGO4D / hands-and-objects
☆80Updated 3 years ago
roeiherz / AG2Video
Code for "Compositional Video Synthesis with Action Graphs", Bar & Herzig et al., ICML 2021
☆32Updated 3 years ago
DirtyHarryLYL / SymNet
As a part of the HAKE project (HAKE-Object), code for SymNet (CVPR'20 and TPAMI'21).
☆53Updated 2 years ago
Buzz-Beater / EgoTaskQA
Code for NeurIPS 2022 Datasets and Benchmarks paper - EgoTaskQA: Understanding Human Tasks in Egocentric Videos.
☆36Updated 2 years ago
epic-kitchens / epic-kitchens-100-annotations
Annotations for the public release of the EPIC-KITCHENS-100 dataset
☆158Updated 3 years ago
zfchenUnique / compositional_physics_learner
☆40Updated 3 years ago
snaredataset / snare
SNARE Dataset with MATCH and LaGOR models
☆24Updated last year
neuroailab / PSGNets
Neural Networks that convert input movies into Physical Scene Graphs (PSGs)
☆63Updated 4 years ago
epic-kitchens / epic-kitchens-100-narrator
Video narrator written in Python/GTK using vlc-lib
☆25Updated 3 years ago
DirtyHarryLYL / DJ-RN
As a part of HAKE project (HAKE-3D). Code for our CVPR2020 paper "Detailed 2D-3D Joint Representation for Human-Object Interaction".
☆103Updated 2 years ago
HaozhiQi / RPIN
Learning Long-term Visual Dynamics with Region Proposal Interaction Networks (ICLR 2021)
☆113Updated 3 years ago
DuaneNielsen / keypoints
Unsupervised learning of Object Landmarks through Conditional Image Generation
☆48Updated 5 years ago
gsig / actor-observer
ActorObserverNet code in PyTorch from "Actor and Observer: Joint Modeling of First and Third-Person Videos", CVPR 2018
☆82Updated 6 years ago