bo-miao / RefHumanLinks
[NeurIPS 2024] Referring Human Pose and Mask Estimation In the Wild
☆43Updated 4 months ago
Alternatives and similar repositories for RefHuman
Users that are interested in RefHuman are comparing it to the libraries listed below
Sorting:
- [TCSVT 2024] Official PyTorch implementation of the paper "MLP: Motion Label Prior for Temporal Sentence Localization in Untrimmed 3D Hum…☆24Updated 10 months ago
- CoS: Chain-of-Shot Prompting for Long Video Understanding☆48Updated 3 months ago
- An open-source library with a powerful Contrastive Language-and-Motion (CLaM) pre-training evaluator☆97Updated last month
- ☆89Updated last year
- ☆80Updated 7 months ago
- ICCV 2023: Weakly-supervised 3D Pose Transfer with Keypoints☆58Updated last month
- Official implementation of "Generating images with 3D annotations using diffusion models".☆47Updated 9 months ago
- [ICCV 2023] Spectrum-guided Multi-granularity Referring Video Object Segmentation.☆109Updated last month
- Free-T2M: Frequency enhanced text-to-motion diffusion model with consistency loss☆66Updated 3 months ago
- Official Implementation of AttentionShift: Iteratively Estimated Part-based Attention Map for Pointly Supervised Instance Segmentation☆158Updated 7 months ago
- UniInst☆99Updated last year
- [ICLR 2025] Ctrl-U: Robust Conditional Image Generation via Uncertainty-aware Reward Modeling☆78Updated 3 months ago
- [IJCV 2024] RAD: A Dataset and Benchmark for Real-Life Anomaly Detection with Robotic Observations☆36Updated 7 months ago
- Official Code of "GeReA: Question-Aware Prompt Captions for Knowledge-based Visual Question Answering"☆111Updated 7 months ago
- Language-to-4D Modeling Towards 6-DoF Tracking and Shape Reconstruction in 3D Point Cloud Stream [CVPR2024]☆66Updated last year
- The official implementation of MotionLab☆118Updated 2 months ago
- [CVPR 2024] Interactive continual learning: Fast and slow thinking☆99Updated 10 months ago
- ☆44Updated last month
- [MM'24 Oral] Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval☆126Updated 9 months ago
- Domain Prompt Learning with Quaternion Networks (CVPR2024 Highlight)☆79Updated 5 months ago
- ☆36Updated 11 months ago
- [ICRA 2025]AVD2: Accident Video Diffusion for Accident Video Description☆81Updated 2 weeks ago
- Official implemetation of "Enhancing Close-up Novel View Synthesis via Pseudo-labeling" [AAAI 2025]☆13Updated last month
- Image and video Tokenizer/VAE selection guide, text and face reconstruction evaluation.☆16Updated last week
- (NeurIPS 2024) Official PyTorch implementation of LOVA3☆85Updated 2 months ago
- ✨✨latest advancements in VLA models(VIsion Language Action)☆73Updated last month
- Official Implementation for "Mask-based modeling for Neural Radiance Fields" (ICLR 2024)☆37Updated 11 months ago
- Rethinking Video-Text Understanding Retrieval from Counterfactually Augmented Data☆39Updated 10 months ago
- [CVPR'24 Highlight] SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection☆92Updated 10 months ago
- Domain-Controlled Prompt Learning (AAAI2024)☆88Updated 6 months ago