flyinglynx / CapeFormer
Official Implementation for "Matching Is Not Enough: A Two-Stage Framework for Category-Agnostic Pose Estimation", CVPR 2023.
☆50Updated last year
Alternatives and similar repositories for CapeFormer:
Users that are interested in CapeFormer are comparing it to the libraries listed below
- ☆66Updated 8 months ago
- [CVPR 2023] Segmenting objects in videos without human annotations 🤯: Official implementation for Bootstrapping Objectness from Videos b…☆37Updated last year
- The official repo for ECCV'22 paper: Pose for Everything: Towards Category-Agnostic Pose Estimation☆208Updated 10 months ago
- [CVPR2022 Oral] VISOLO: Grid-Based Space-Time Aggregation for Efficient Online Video Instance Segmentation☆29Updated 2 years ago
- Offical code of LOGO-CAP (CVPR' 22). https://arxiv.org/abs/2109.03622☆33Updated 2 years ago
- ☆64Updated last year
- ☆23Updated last year
- CAPE using text-graphs☆19Updated last month
- ☆19Updated last year
- CVPR2022: Large-scale Video Panoptic Segmentation in the Wild: A Benchmark☆140Updated last year
- Official PyTorch implementation of Self-Supervised Any-Point Tracking by Contrastive Random Walks, ECCV 2024.☆50Updated 4 months ago
- Official PyTorch implementation for TCSVT 23 "Detect Any Shadow: Segment Anything for Video Shadow Detection"☆57Updated 4 months ago
- Single-stage Multiple Hand Reconstruction☆30Updated last year
- [NeurIPS 2023] FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation Models☆130Updated last year
- [CVPR-2022 (oral)]-Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation☆151Updated last year
- ☆111Updated 8 months ago
- ☆104Updated last year
- ☆88Updated 2 months ago
- [NeurIPS 2022] Official implementation of the paper "Rethinking Resolution in the Context of Efficient Video Recognition".☆31Updated 2 years ago
- [NeurIPS2023] Implementation of the paper: Explore In-Context Learning for 3D Point Cloud Understanding☆67Updated 4 months ago
- ☆19Updated 6 months ago
- CVPR 2021 VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild☆30Updated 2 years ago
- [Preprint 2022] “Can We Solve 3D Vision Tasks Starting from A 2D Vision Transformer?” by Yi Wang, Zhiwen Fan, Tianlong Chen, Hehe Fan, Zh…☆61Updated 2 years ago
- ☆58Updated last year
- The project is an official implementation of our paper "POTTER: Pooling Attention Transformer for Efficient Human Mesh Recovery".☆44Updated last year
- [IEEE TMM] GEM: Boost Simple Network for Glass Surface Segmentation via Vision Foundation Models☆13Updated 2 months ago
- The official project of ACM MM 2022 paper "Less is More: Consistent Video Depth Estimation with Masked Frames Modeling".☆36Updated last year
- [ECCV 2022] 🎵PolyphonicFormer: Unified Query Learning for Depth-aware Video Panoptic Segmentation☆57Updated 2 years ago
- MFT: Long-Term Tracking of Every Pixel -- code for the WACV 2024 paper☆56Updated 4 months ago
- [ICLR 2025] Track-On: Transformer-based Online Point Tracking with Memory☆38Updated last month