Affordance Grounding from Demonstration Video to Target Image (CVPR 2023)
☆46Jul 26, 2024Updated last year
Alternatives and similar repositories for afformer
Users that are interested in afformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ECCV 2022] AssistQ: Affordance-centric Question-driven Task Completion for Egocentric Assistant☆23Jan 30, 2026Updated 4 months ago
- [Findings of EMNLP 2022] AssistSR: Task-oriented Video Segment Retrieval for Personal AI Assistant☆23Sep 11, 2023Updated 2 years ago
- Official PyTorch Implementation of Learning Affordance Grounding from Exocentric Images, CVPR 2022☆75Nov 1, 2024Updated last year
- Code for MANO-GCN —— "Capturing Implicit Spatial Cues for Monocular 3D Hand Reconstruction" (ICME2021 Oral)☆13Jun 24, 2021Updated 4 years ago
- [ICCV 2023] Understanding 3D Object Interaction from a Single Image☆47Feb 29, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…☆13May 25, 2023Updated 3 years ago
- Learning interaction hotspots from egocentric video☆52Dec 12, 2022Updated 3 years ago
- team Doggeee's solution to Ego4D LTA challenge@CVPRW23'☆14Nov 4, 2023Updated 2 years ago
- DropIT: Dropping Intermediate Tensors for Memory-Efficient DNN Training (ICLR 2023)☆32Apr 8, 2023Updated 3 years ago
- The implementation and supplementary material for our RA-L work "An Affordance Keypoint Detection Network for Robot Manipulation".☆32Jun 15, 2021Updated 5 years ago
- ☆11Apr 23, 2025Updated last year
- Video + CLIP Baseline for Ego4D Long Term Action Anticipation Challenge (CVPR 2022)☆15Jul 4, 2022Updated 3 years ago
- ☆26May 19, 2022Updated 4 years ago
- ☆21Dec 23, 2025Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Edit and Generate Anything in 3D world!☆13Apr 15, 2023Updated 3 years ago
- ☆75May 10, 2024Updated 2 years ago
- ☆47Aug 8, 2024Updated last year
- Code for Ditto in the House: Building Articulation Models of Indoor Scenes through Interactive Perception☆17Aug 25, 2023Updated 2 years ago
- [ICRA 2024] Language-Conditioned Affordance-Pose Detection in 3D Point Clouds☆53Jan 10, 2025Updated last year
- [CVPR 2022] Joint hand motion and interaction hotspots prediction from egocentric videos☆71Jan 29, 2024Updated 2 years ago
- ☆31Mar 24, 2022Updated 4 years ago
- LOCATE: Localize and Transfer Object Parts for Weakly Supervised Affordance Grounding (CVPR 2023)☆48Apr 28, 2023Updated 3 years ago
- Generating Structured Pseudo Labels for Noise-resistant Zero-shot Video Sentence Localization☆16Jul 20, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code for the paper: F. Ragusa, G. M. Farinella, A. Furnari. StillFast: An End-to-End Approach for Short-Term Object Interaction Anticipat…☆13Apr 11, 2023Updated 3 years ago
- The Pytorch implementation of Grounding 3D Object Affordance from 2D Interactios in Images.☆138Nov 17, 2023Updated 2 years ago
- [ICCV 2023] Label-Efficient Online Continual Object Detection in Streaming Video☆23Jan 8, 2024Updated 2 years ago
- ☆12Mar 12, 2023Updated 3 years ago
- Codes for our ACM MM 2019 paper: "Exploiting Temporal Relationships in Video Moment Localization with Natural Language"☆16Oct 22, 2022Updated 3 years ago
- ☆140Mar 16, 2023Updated 3 years ago
- Code for Static and Dynamic Concepts for Self-supervised Video Representation Learning.☆11Jul 28, 2022Updated 3 years ago
- Pre-Trained Visual Representations for Control☆21May 26, 2022Updated 4 years ago
- [NeurIPS 2022] Egocentric Video-Language Pretraining☆260May 9, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The Pytorch implementation for "Video-Text Pre-training with Learned Regions"☆43Jul 15, 2022Updated 3 years ago
- ☆15Jun 14, 2025Updated last year
- [CVPR25 Highlight] Official implementation of Fun3DU, a method for functional understanding and segmentation in 3D scenes☆49Sep 30, 2025Updated 8 months ago
- ☆17Jun 15, 2022Updated 4 years ago
- Official repository of "TACO: Benchmarking Generalizable Bimanual Tool-ACtion-Object Understanding".☆66Dec 1, 2025Updated 6 months ago
- ☆83Aug 1, 2023Updated 2 years ago
- [ICCV2021] Generic Event Boundary Detection: A Benchmark for Event Segmentation☆77Dec 28, 2021Updated 4 years ago