rese1f / UniAPLinks
[AAAI 2024] UniAP: Towards Universal Animal Perception in Vision via Few-shot Learning
☆12Updated last year
Alternatives and similar repositories for UniAP
Users that are interested in UniAP are comparing it to the libraries listed below
Sorting:
- [ICCV 2023] Global Adaptation meets Local Generalization: Unsupervised Domain Adaptation for 3D Human Pose Estimation☆23Updated last year
- Official code for MotionBench (CVPR 2025)☆45Updated 3 months ago
- ☆48Updated 2 months ago
- [ECCV2024, Oral, Best Paper Finalist] This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation…☆37Updated 4 months ago
- For Ego4D VQ3D Task☆20Updated last year
- [CVPR2024] Official implementation of the paper: Skeleton-in-Context: Unified Skeleton Sequence Modeling with In-Context Learning☆39Updated last year
- [ECCV 2024 Oral] ActionVOS: Actions as Prompts for Video Object Segmentation☆33Updated 6 months ago
- Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models☆85Updated 9 months ago
- [ICLR 2025] CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion☆46Updated 5 months ago
- Official implementation of the paper "Boosting Human-Object Interaction Detection with Text-to-Image Diffusion Model"☆62Updated last year
- [ECCV2024] The official implementation of "Listen to Look into the Future: Audio-Visual Egocentric Gaze Anticipation".☆12Updated 4 months ago
- Accepted by CVPR 2024☆34Updated last year
- [NeurIPS2024 D&B Spotlight] GAIA: Rethinking Action Quality Assessment for AI-Generated Videos☆28Updated 2 months ago
- [ECCV 2024] STEVE in Minecraft is for See and Think: Embodied Agent in Virtual Environment☆39Updated last year
- Accepted by CVPR 2023☆42Updated last year
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆57Updated 9 months ago
- Code and Dataset for the CVPRW Paper "Where did I leave my keys? — Episodic-Memory-Based Question Answering on Egocentric Videos"☆27Updated last year
- [CVPR 2024] Data and benchmark code for the EgoExoLearn dataset☆61Updated 9 months ago
- Code implementation for paper titled "HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision"☆27Updated last year
- Data release for Step Differences in Instructional Video (CVPR24)☆14Updated last year
- [BMVC2022, IJCV2023, Best Student Paper, Spotlight] Official codes for the paper "In the Eye of Transformer: Global-Local Correlation for…☆27Updated 4 months ago
- [ICLR 2024] Seer: Language Instructed Video Prediction with Latent Diffusion Models☆33Updated last year
- FleVRS: Towards Flexible Visual Relationship Segmentation, NeurIPS 2024☆21Updated 6 months ago
- ☆37Updated 3 months ago
- ☆31Updated last year
- [ICML 2024] A Touch, Vision, and Language Dataset for Multimodal Alignment☆78Updated 3 weeks ago
- This is the offical repository of LLAVIDAL☆15Updated 3 months ago
- Code release for "EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone" [ICCV, 2023]☆99Updated 11 months ago
- Code for our paper: Learning Camera Movement Control from Real-World Drone Videos☆29Updated 2 months ago
- Code release for the paper "Egocentric Video Task Translation" (CVPR 2023 Highlight)☆32Updated 2 years ago