Affordance Grounding from Demonstration Video to Target Image (CVPR 2023)
☆46Jul 26, 2024Updated last year
Alternatives and similar repositories for afformer
Users that are interested in afformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ECCV 2022] AssistQ: Affordance-centric Question-driven Task Completion for Egocentric Assistant☆23Jan 30, 2026Updated 3 months ago
- [Findings of EMNLP 2022] AssistSR: Task-oriented Video Segment Retrieval for Personal AI Assistant☆23Sep 11, 2023Updated 2 years ago
- Official PyTorch Implementation of Learning Affordance Grounding from Exocentric Images, CVPR 2022☆75Nov 1, 2024Updated last year
- Code for MANO-GCN —— "Capturing Implicit Spatial Cues for Monocular 3D Hand Reconstruction" (ICME2021 Oral)☆13Jun 24, 2021Updated 4 years ago
- [ICCV 2023] Understanding 3D Object Interaction from a Single Image☆47Feb 29, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…☆13May 25, 2023Updated 2 years ago
- [AAAI2023] Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task (Oral)☆42Mar 23, 2024Updated 2 years ago
- Learning interaction hotspots from egocentric video☆52Dec 12, 2022Updated 3 years ago
- team Doggeee's solution to Ego4D LTA challenge@CVPRW23'☆14Nov 4, 2023Updated 2 years ago
- The implementation and supplementary material for our RA-L work "An Affordance Keypoint Detection Network for Robot Manipulation".☆32Jun 15, 2021Updated 4 years ago
- ☆11Apr 23, 2025Updated last year
- Video + CLIP Baseline for Ego4D Long Term Action Anticipation Challenge (CVPR 2022)☆15Jul 4, 2022Updated 3 years ago
- ☆26May 19, 2022Updated 3 years ago
- ☆21Dec 23, 2025Updated 4 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Edit and Generate Anything in 3D world!☆14Apr 15, 2023Updated 3 years ago
- Online Product Reviews for Affordances☆24Dec 12, 2018Updated 7 years ago
- ☆75May 10, 2024Updated last year
- ☆47Aug 8, 2024Updated last year
- Code for Ditto in the House: Building Articulation Models of Indoor Scenes through Interactive Perception☆17Aug 25, 2023Updated 2 years ago
- [ICRA 2024] Language-Conditioned Affordance-Pose Detection in 3D Point Clouds☆52Jan 10, 2025Updated last year
- [CVPR 2022] Joint hand motion and interaction hotspots prediction from egocentric videos☆72Jan 29, 2024Updated 2 years ago
- ☆31Mar 24, 2022Updated 4 years ago
- LOCATE: Localize and Transfer Object Parts for Weakly Supervised Affordance Grounding (CVPR 2023)☆48Apr 28, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Generating Structured Pseudo Labels for Noise-resistant Zero-shot Video Sentence Localization☆16Jul 20, 2023Updated 2 years ago
- Code for the paper: F. Ragusa, G. M. Farinella, A. Furnari. StillFast: An End-to-End Approach for Short-Term Object Interaction Anticipat…☆13Apr 11, 2023Updated 3 years ago
- [ICCV 2023] Label-Efficient Online Continual Object Detection in Streaming Video☆23Jan 8, 2024Updated 2 years ago
- ☆12Mar 12, 2023Updated 3 years ago
- Codes for our ACM MM 2019 paper: "Exploiting Temporal Relationships in Video Moment Localization with Natural Language"☆16Oct 22, 2022Updated 3 years ago
- ☆140Mar 16, 2023Updated 3 years ago
- Code for Static and Dynamic Concepts for Self-supervised Video Representation Learning.☆11Jul 28, 2022Updated 3 years ago
- Pre-Trained Visual Representations for Control☆21May 26, 2022Updated 3 years ago
- [NeurIPS 2022] Egocentric Video-Language Pretraining☆260May 9, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The Pytorch implementation for "Video-Text Pre-training with Learned Regions"☆43Jul 15, 2022Updated 3 years ago
- ☆15Jun 14, 2025Updated 10 months ago
- [CVPR25 Highlight] Official implementation of Fun3DU, a method for functional understanding and segmentation in 3D scenes☆49Sep 30, 2025Updated 7 months ago
- ☆17Jun 15, 2022Updated 3 years ago
- Official repository of "TACO: Benchmarking Generalizable Bimanual Tool-ACtion-Object Understanding".☆65Dec 1, 2025Updated 5 months ago
- [ICCV2021] Generic Event Boundary Detection: A Benchmark for Event Segmentation☆76Dec 28, 2021Updated 4 years ago
- [CVPR-2025] GREAT: Geometry-Intention Collaborative Inference for Open-Vocabulary 3D Object Affordance Grounding☆43Aug 15, 2025Updated 8 months ago