shajiayu1 / DiffusionPose
☆13Updated 2 years ago
Alternatives and similar repositories for DiffusionPose:
Users that are interested in DiffusionPose are comparing it to the libraries listed below
- [CVPR2023] This is an official implementation of paper "DETRs with Hybrid Matching".☆14Updated 2 years ago
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆37Updated last year
- ICCV DeeperAction Challenge - Kinetics-TPS Challenge on Part-level Action Parsing and Action Recognition.☆15Updated 3 years ago
- ☆32Updated 2 years ago
- ☆25Updated 5 months ago
- Unifying Visual Perception by Dispersible Points Learning (ECCV 2022)☆51Updated 2 years ago
- Benchmarking Attention Mechanism in Vision Transformers.☆17Updated 2 years ago
- [ICCV 2021] Official PyTorch Code for "Online Knowledge Distillation for Efficient Pose Estimation"☆43Updated last year
- ☆19Updated last year
- CV701 Assignment on Pose Estimation☆17Updated 4 months ago
- ☆17Updated 2 years ago
- A Siamese self-supervised pretraining approach for the Transformer architecture in DETR☆36Updated last year
- An object detection codebase based on MegEngine.☆28Updated 2 years ago
- Official Pytorch implementation for Distilling Image Classifiers in Object detection (NeurIPS2021)☆31Updated 3 years ago
- Unofficial implement of "Pix2seq: A Language Modeling Framework for Object Detection" on mmdetection☆31Updated 2 years ago
- Teach-DETR: Better Training DETR with Teachers☆31Updated last year
- ☆11Updated last year
- LongShortNet for Streaming Perception task.☆13Updated last year
- [ICME 2023] FlowText: Synthesizing Realistic Scene Text Video with Optical Flow Estimation☆11Updated last year
- ☆37Updated 2 years ago
- ☆25Updated 3 years ago
- Large-batch Optimization for Dense Visual Predictions (NeurIPS 2022)☆56Updated 2 years ago
- ☆13Updated 3 years ago
- 一个mmcv 的logger hook, 可以用来把模型结果推送到微信上☆20Updated 2 years ago
- Scaling Multi-modal Instruction Fine-tuning with Tens of Thousands Vision Task Types☆13Updated last month
- Code For Our Work: DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries [ECCV-2024]☆14Updated 8 months ago
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆27Updated last year
- DreamDance: Personalized Text-to-video Generation by Combining Text-to-Image Synthesis and Motion Transfer☆14Updated 2 years ago
- ☆59Updated last year
- Code for "The Box Size Confidence Bias Harms Your Object Detector" (https://arxiv.org/abs/2112.01901)☆27Updated 2 years ago