nguyennm1024 / OSCaRLinks

🔥🔥🔥 Object State Description & Change Detection

☆10

Alternatives and similar repositories for OSCaR

Users that are interested in OSCaR are comparing it to the libraries listed below

Sorting:

facebookresearch / VidOSC
Code and data release for the paper "Learning Object State Changes in Videos: An Open-World Perspective" (CVPR 2024)
☆35Updated last year
zihuixue / AlignEgoExo
Code and data release for the paper "Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Align…
☆18Updated last year
soCzech / ChangeIt
ChangeIt dataset with more than 2600 hours of video with state-changing actions published at CVPR 2022
☆11Updated 3 years ago
Sid2697 / EgoProceL-egocentric-procedure-learning
Code implementation for our ECCV, 2022 paper titled "My View is the Best View: Procedure Learning from Egocentric Videos"
☆30Updated last year
brown-palm / AntGPT
Official code implemtation of paper AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos?
☆24Updated last year
epic-kitchens / epic-kitchens-100-hand-object-bboxes
A repo for processing the raw hand object detections to produce releasable pickles + library for using these
☆38Updated last year
soCzech / MultiTaskObjectStates
Code for the paper "Multi-Task Learning of Object States and State-Modifying Actions from Web Videos" published in TPAMI
☆11Updated last year
showlab / afformer
Affordance Grounding from Demonstration Video to Target Image (CVPR 2023)
☆44Updated last year
soCzech / LookForTheChange
Code for Look for the Change paper published at CVPR 2022
☆36Updated 3 years ago
assembly-101 / assembly101-download-scripts
Python scripts to download Assembly101 from Google Drive
☆52Updated last year
Yuhan-Shen / ProTAS
Progress-Aware Online Action Segmentation for Egocentric Procedural Task Videos
☆27Updated last year
lhc1224 / Cross-View-AG
Official PyTorch Implementation of Learning Affordance Grounding from Exocentric Images, CVPR 2022
☆67Updated 11 months ago
huiwon-jang / RSP
Visual Representation Learning with Stochastic Frame Prediction (ICML 2024)
☆23Updated 11 months ago
gistvision / moca
Code and models of MOCA (Modular Object-Centric Approach) proposed in "Factorizing Perception and Policy for Interactive Instruction Foll…
☆38Updated last year
allenai / embodied-clip
Official codebase for EmbCLIP
☆132Updated 2 years ago
June01 / tcc_Temporal_Cycle_Consistency_Loss.pytorch
This is the pytorch version of tcc loss, used in paper 'Temporal Cycle-Consistency Learning'.
☆26Updated 5 years ago
fpv-iplab / stillfast
Code for the paper: F. Ragusa, G. M. Farinella, A. Furnari. StillFast: An End-to-End Approach for Short-Term Object Interaction Anticipat…
☆11Updated 2 years ago
EGO4D / forecasting
☆74Updated last year
robert80203 / EgoPER_official
The official implementation of Error Detection in Egocentric Procedural Task Videos
☆19Updated last month
epic-kitchens / epic-kitchens-100-narrator
Video narrator written in Python/GTK using vlc-lib
☆25Updated 3 years ago
WenliangGuo / SCHEMA
[ICLR 2024 Poster] SCHEMA: State CHangEs MAtter for Procedure Planning in Instructional Videos
☆19Updated 2 months ago
Sid2697 / HOI-Ref
Code implementation for paper titled "HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision"
☆29Updated last year
YuLiu-LY / BO-QSA
This repository is the official implementation of Improving Object-centric Learning With Query Optimization
☆51Updated 2 years ago
NVlabs / Bongard-HOI
[CVPR 2022 (oral)] Bongard-HOI for benchmarking few-shot visual reasoning
☆72Updated 2 years ago
facebookresearch / htstep
HT-Step is a large-scale article grounding dataset of temporal step annotations on how-to videos
☆22Updated last year
epic-kitchens / epic-kitchens-100-annotations
Annotations for the public release of the EPIC-KITCHENS-100 dataset
☆158Updated 3 years ago
lmur98 / epic_kitchens_affordances
☆11Updated 2 years ago
Buzz-Beater / EgoTaskQA
Code for NeurIPS 2022 Datasets and Benchmarks paper - EgoTaskQA: Understanding Human Tasks in Egocentric Videos.
☆35Updated 2 years ago
wwwwwyyyyyxxxxx / SA2GVAN
☆13Updated 2 years ago
HimangiM / RepLAI
Self-supervised algorithm for learning representations from ego-centric video data. Code is tested on EPIC-Kitchens-100 and Ego4D in PyTo…
☆12Updated 3 years ago