khoiucd / escape-tgt
Official Implementation for "ESCAPE: Encoding Super-keypoints for Category-Agnostic Pose Estimation", CVPR 2024.
☆9Updated 10 months ago
Alternatives and similar repositories for escape-tgt
Users that are interested in escape-tgt are comparing it to the libraries listed below
Sorting:
- ☆16Updated last year
- [ECCV 2024] Code for Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation☆34Updated 2 months ago
- [ICCV 2023] PyTorch implementation of RandBox☆53Updated last year
- Code for Semantics Meets Temporal Correspondence: Self-supervised Object-centric Learning in Videos☆9Updated 8 months ago
- Proportional Amplitude Spectrum Training Augmentation for Synthetic to Real Domain Generalization☆21Updated last year
- [NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions☆60Updated last year
- The official repository for ICLR2024 paper "FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition"☆79Updated 3 months ago
- ☆47Updated 2 years ago
- official implementation of CVPR 23 paper "M3Video: Masked Motion Modeling for Self-Supervised Video Representation Learning"☆51Updated last year
- [CVPR 2023] Official repository for paper "Stare at What You See: Masked Image Modeling without Reconstruction"☆69Updated last year
- Official implementation of the CVPR'24 paper [Adaptive Slot Attention: Object Discovery with Dynamic Slot Number]☆42Updated 3 months ago
- [CVPR 2024] The official implementation of paper "Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training"☆35Updated last year
- [ICLR 2024] Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models.☆78Updated 9 months ago
- The official code for Relational Context Learning for Human-Object Interaction Detection, CVPR2023.☆48Updated 2 years ago
- This repo contains the official implementation of ICLR 2024 paper "Is ImageNet worth 1 video? Learning strong image encoders from 1 long …☆88Updated 11 months ago
- Official code repo of PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs☆26Updated 3 months ago
- Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders☆108Updated last month
- ☆61Updated last year
- [NeurIPS 2023] Self-supervised Object-Centric Learning for Videos☆27Updated 5 months ago
- [NeurIPS 2024] Official code for paper "EZ-HOI: VLM Adaptation via Guided Prompt Learning for Zero-Shot HOI Detection"☆32Updated last month
- (ICCV 2023) MasQCLIP for Open-Vocabulary Universal Image Segmentation☆37Updated last year
- CAPE using text-graphs☆22Updated last month
- [ECCV 2024] PyTorch implementation of CropMAE, introduced in "Efficient Image Pre-Training with Siamese Cropped Masked Autoencoders"☆58Updated 2 months ago
- OVAD: Open-vocabulary Attribute Detection code☆29Updated last year
- Official implementation of the paper "Boosting Human-Object Interaction Detection with Text-to-Image Diffusion Model"☆60Updated last year
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆82Updated last month
- [NeurIPS 2023] OV-PARTS: Towards Open-Vocabulary Part Segmentation☆83Updated 10 months ago
- IFSeg: Image-free Semantic Segmentation via Vision-Language Model (CVPR 2023)☆90Updated last year
- The official github repo for "Test-Time Training with Masked Autoencoders"☆82Updated last year
- ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning☆41Updated last year