bo-miao / RefHuman
[NeurIPS 2024] Referring Human Pose and Mask Estimation In the Wild
☆43Updated 4 months ago
Alternatives and similar repositories for RefHuman:
Users that are interested in RefHuman are comparing it to the libraries listed below
- CoS: Chain-of-Shot Prompting for Long Video Understanding☆47Updated 2 months ago
- [TCSVT 2024] Official PyTorch implementation of the paper "MLP: Motion Label Prior for Temporal Sentence Localization in Untrimmed 3D Hum…☆24Updated 9 months ago
- [ICCV 2023] Spectrum-guided Multi-granularity Referring Video Object Segmentation.☆109Updated last month
- ☆90Updated last year
- An open-source library with a powerful Contrastive Language-and-Motion (CLaM) pre-training evaluator☆97Updated last month
- ☆80Updated 6 months ago
- Official Implementation of AttentionShift: Iteratively Estimated Part-based Attention Map for Pointly Supervised Instance Segmentation☆157Updated 6 months ago
- Official implementation of "Generating images with 3D annotations using diffusion models".☆47Updated 8 months ago
- ICCV 2023: Weakly-supervised 3D Pose Transfer with Keypoints☆58Updated last week
- Free-T2M: Frequency enhanced text-to-motion diffusion model with consistency loss☆65Updated 3 months ago
- ☆43Updated 2 weeks ago
- [CVPR'24 Highlight] SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection☆92Updated 9 months ago
- Official Code of "GeReA: Question-Aware Prompt Captions for Knowledge-based Visual Question Answering"☆110Updated 7 months ago
- A Unified Driving World Model for Future Generation and Perception☆102Updated last month
- [ECCV 2022] GEB+: A Benchmark for Generic Event Boundary Captioning, Grounding and Retrieval☆49Updated 2 months ago
- [MM'24 Oral] Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval☆124Updated 8 months ago
- UniInst☆99Updated last year
- [ICRA 2025]AVD2: Accident Video Diffusion for Accident Video Description☆79Updated 2 months ago
- [ICLR'24] Democratizing Fine-grained Visual Recognition with Large Language Models☆176Updated 9 months ago
- [ICLR 2025] Ctrl-U: Robust Conditional Image Generation via Uncertainty-aware Reward Modeling☆78Updated 2 months ago
- (NeurIPS 2024) Official PyTorch implementation of LOVA3☆83Updated last month
- Language-to-4D Modeling Towards 6-DoF Tracking and Shape Reconstruction in 3D Point Cloud Stream [CVPR2024]☆66Updated last year
- Rethinking Video-Text Understanding Retrieval from Counterfactually Augmented Data☆39Updated 9 months ago
- [CVPR 2024] Interactive continual learning: Fast and slow thinking☆99Updated 10 months ago
- ☆36Updated 10 months ago
- [IJCV 2024] RAD: A Dataset and Benchmark for Real-Life Anomaly Detection with Robotic Observations☆36Updated 6 months ago
- Official implementation of ECCV2022 paper End-to-End Weakly Supervised Object Detection with Sparse Proposal Evolution☆101Updated last year
- Official code for "A Closer Look at Audio-Visual Segmentation"☆94Updated 2 months ago
- Domain Prompt Learning with Quaternion Networks (CVPR2024 Highlight)☆77Updated 4 months ago
- [ECCV2022] Learning Quality-aware Dynamic Memory for Video Object Segmentation☆122Updated 2 years ago