Reagan1311 / OOALLinks

One-Shot Open Affordance Learning with Foundation Models (CVPR 2024)

☆45

Alternatives and similar repositories for OOAL

Users that are interested in OOAL are comparing it to the libraries listed below

Sorting:

Reagan1311 / LOCATE
LOCATE: Localize and Transfer Object Parts for Weakly Supervised Affordance Grounding (CVPR 2023)
☆45Updated 2 years ago
Hoyyyaard / 3DFlowAction
☆41Updated 5 months ago
Fsoft-AIC / Open-Vocabulary-Affordance-Detection-in-3D-Point-Clouds
[IROS 2023] Open-Vocabulary Affordance Detection in 3d Point Clouds
☆81Updated last year
Kami-code / HandsOnVLM-release
HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction
☆42Updated 2 months ago
ZzZZCHS / RoboGround
Code & data for "RoboGround: Robotic Manipulation with Grounded Vision-Language Priors" (CVPR 2025)
☆32Updated 6 months ago
lhc1224 / Cross-View-AG
Official PyTorch Implementation of Learning Affordance Grounding from Exocentric Images, CVPR 2022
☆70Updated last year
changhaonan / A3VLM
[CoRL2024] Official repo of `A3VLM: Actionable Articulation-Aware Vision Language Model`
☆121Updated last year
stevenlsw / hoi-forecast
[CVPR 2022] Joint hand motion and interaction hotspots prediction from egocentric videos
☆71Updated last year
Tengbo-Yu / AnyBimanual
[ICCV2025] AnyBimanual: Transfering Unimanual Policy for General Bimanual Manipulation
☆93Updated 5 months ago
michaelyuancb / general_flow
Repository for "General Flow as Foundation Affordance for Scalable Robot Learning"
☆67Updated 11 months ago
cfeng16 / UniTouch
[CVPR 2024] Binding Touch to Everything: Learning Unified Multimodal Tactile Representations
☆72Updated 2 weeks ago
yl3800 / LASO
☆40Updated last year
Dantong88 / LLARVA
☆60Updated 11 months ago
TeleeMa / GLOVER
This is the official code repo for GLOVER and GLOVER++.
☆40Updated 4 months ago
wudongming97 / AffordanceNet
[ICCV 2025] RAGNet: Large-scale Reasoning-based Affordance Segmentation Benchmark towards General Grasping
☆30Updated 2 weeks ago
HaoyiZhu / PointCloudMatters
[NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning
☆89Updated last year
homangab / Track-2-Act
code for the paper Predicting Point Tracks from Internet Videos enables Diverse Zero-Shot Manipulation
☆99Updated last year
robomonkey-vla / RoboMonkey
☆23Updated last month
Max-Fu / otter
[ICML 2025] OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction
☆111Updated 7 months ago
TEA-Lab / Robo-ABC
[ECCV 2024] 🎉 Official repository of "Robo-ABC: Affordance Generalization Beyond Categories via Semantic Correspondence for Robot Manipu…
☆93Updated last year
vlc-robot / robot_sugar
Official implementation of "SUGAR: Pre-training 3D Visual Representations for Robotics" (CVPR'24).
☆44Updated 5 months ago
cvlab-columbia / dreamitate
Dreamitate: Real-World Visuomotor Policy Learning via Video Generation (CoRL 2024)
☆57Updated 6 months ago
Koorye / Inspire
Official implemetation of the paper "InSpire: Vision-Language-Action Models with Intrinsic Spatial Reasoning"
☆46Updated last week
liyi14 / HAMSTER_beta
☆51Updated 7 months ago
Selen-Suyue / DensePolicy
[ICCV 2025] Dense Policy: Bidirectional Autoregressive Learning of Actions #DSP
☆71Updated last month
TianxingChen / G3Flow
[CVPR 25] G3Flow: Generative 3D Semantic Flow for Pose-aware and Generalizable Object Manipulation
☆90Updated 6 months ago
vlc-robot / polarnet
[CoRL2023] Official PyTorch implementation of PolarNet: 3D Point Clouds for Language-Guided Robotic Manipulation
☆41Updated last year
MarionLepert / phantom
☆55Updated 3 months ago
jmwang0117 / Video4Robot
List of papers on video-centric robot learning
☆22Updated last year
jiaming-zhou / X-ICM
official repo for AGNOSTOS, a cross-task manipulation benchmark, and X-ICM method, a cross-task in-context manipulation (VLA) method
☆51Updated last week