eladb3 / ORViT
"Object-Region Video Transformers”, Herzig et al., CVPR 2022
☆42Updated 2 years ago
Related projects: ⓘ
- [CVPR2024 Highlight] Official repository of the paper "The devil is in the fine-grained details: Evaluating open-vocabulary object detect…☆39Updated last month
- ☆17Updated 5 months ago
- [NeurIPS 2022 Spotlight] RLIP: Relational Language-Image Pre-training and a series of other methods to solve HOI detection and Scene Grap…☆71Updated 3 months ago
- BEAR: a new BEnchmark on video Action Recognition☆40Updated 4 months ago
- ☆68Updated 11 months ago
- ☆47Updated 2 years ago
- Implementation of paper 'Helping Hands: An Object-Aware Ego-Centric Video Recognition Model'☆30Updated 10 months ago
- Official PyTorch implementation of the paper "Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring"☆91Updated 7 months ago
- [CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers☆166Updated 11 months ago
- Code for ECCV2022 Paper "Mining Cross-Person Cues for Body-Part Interactiveness Learning in HOI Detection"☆33Updated last year
- Temporal Action Localization Visualization Tool (TALVT) is a Javascript based simple visualization tool to visualize the outcomes of the …☆27Updated 3 years ago
- A pytorch Implementation of Open Vocabulary Object Detection with Pseudo Bounding-Box Labels☆55Updated last year
- ☆25Updated last year
- Utilities for the human-object interaction detection dataset HICO-DET☆50Updated 9 months ago
- Official repository for "Self-Supervised Video Transformer" (CVPR'22)☆100Updated 2 months ago
- MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge (ICCV 2023)☆25Updated last year
- CVPR 2023 Accepted Paper HOICLIP: Efficient Knowledge Transfer for HOI Detection with Vision-Language Models☆52Updated 6 months ago
- [ECCV 2022] Official Pytorch Implementation of paper : " Proposal-Free Temporal Action Detection with Global Segmentation Mask Learning "…☆18Updated last year
- ☆51Updated 2 years ago
- ☆29Updated 11 months ago
- ☆80Updated 2 years ago
- ☆67Updated last year
- Code for CVPR22 paper: Exploring Structure-aware Transformer over Interaction Proposals for Human-Object Interaction Detection.☆44Updated last month
- ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning☆35Updated 11 months ago
- ☆50Updated last year
- [CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv…☆101Updated last year
- Official Code for VideoLT: Large-scale Long-tailed Video Recognition (ICCV 2021)☆33Updated 2 years ago
- ☆165Updated 2 years ago
- ☆45Updated last year
- Series of work (ECCV2020, CVPR2021, CVPR2021, ECCV2022) about Compositional Learning for Human-Object Interaction Exploration☆77Updated last year