changhaonan/A3VLM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/changhaonan/A3VLM)

changhaonan / A3VLM

[CoRL2024] Official repo of `A3VLM: Actionable Articulation-Aware Vision Language Model`

☆122

Alternatives and similar repositories for A3VLM

Users that are interested in A3VLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SiyuanHuang95 / ManipVQA
View on GitHub
[IROS24 Oral]ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models
☆102Aug 22, 2024Updated last year
TianxingChen / VideoTracking-For-AxisEst
View on GitHub
[arXiv 2024] Articulated Object Manipulation using Online Axis Estimation with SAM2-Based Tracking
☆18Apr 4, 2025Updated last year
clorislili / ManipLLM
View on GitHub
The official codebase for ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation(cvpr 2024)
☆150Jul 9, 2024Updated 2 years ago
dyson-ai / hdp
View on GitHub
[CVPR 2024] Hierarchical Diffusion Policy for Multi-Task Robotic Manipulation
☆238Apr 9, 2024Updated 2 years ago
OpenGVLab / Instruct2Act
View on GitHub
Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model
☆374Jun 23, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
UT-Austin-RPL / HouseDitto
View on GitHub
Code for Ditto in the House: Building Articulation Models of Indoor Scenes through Interactive Perception
☆17Aug 25, 2023Updated 2 years ago
Fsoft-AIC / Open-Vocabulary-Affordance-Detection-in-3D-Point-Clouds
View on GitHub
[IROS 2023] Open-Vocabulary Affordance Detection in 3d Point Clouds
☆89Sep 4, 2024Updated last year
qiaojunyu / GAMMA-ICRA2024
View on GitHub
☆20Dec 18, 2024Updated last year
zhouxian / act3d-chained-diffuser
View on GitHub
A unified architecture for multimodal multi-task robotic policy learning.
☆185Feb 2, 2024Updated 2 years ago
nickgkan / 3d_diffuser_actor
View on GitHub
Code for the paper "3D Diffuser Actor: Policy Diffusion with 3D Scene Representations"
☆392Aug 17, 2024Updated last year
NVlabs / RVT
View on GitHub
Official Code for RVT-2 and RVT
☆409Feb 14, 2025Updated last year
yxKryptonite / RAM_code
View on GitHub
Official implementation of RAM: Retrieval-Based Affordance Transfer for Generalizable Zero-Shot Robotic Manipulation
☆101Dec 30, 2024Updated last year
UMass-Embodied-AGI / 3D-VLA
View on GitHub
[ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model
☆629Oct 29, 2024Updated last year
LostXine / LLaRA
View on GitHub
[ICLR'25] LLaRA: Supercharging Robot Learning Data for Vision-Language Policy
☆229Mar 29, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
Dantong88 / LLARVA
View on GitHub
☆64Dec 14, 2024Updated last year
Jianghanxiao / RoboEXP
View on GitHub
[CoRL 2024] RoboEXP: Action-Conditioned Scene Graph via Interactive Exploration for Robotic Manipulation
☆132Oct 26, 2025Updated 9 months ago
Robot-VLAs / RoboVLMs
View on GitHub
☆475Apr 14, 2026Updated 3 months ago
f3rm / f3rm
View on GitHub
F3RM: Feature Fields for Robotic Manipulation. Official repo for the paper "Distilled Feature Fields Enable Few-Shot Language-Guided Mani…
☆221Apr 26, 2024Updated 2 years ago
real-stanford / reflect
View on GitHub
[CoRL 2023] REFLECT: Summarizing Robot Experiences for Failure Explanation and Correction
☆107Mar 12, 2024Updated 2 years ago
PKU-EPIC / GAPartNet
View on GitHub
[CVPR 2023 Highlight] GAPartNet: Cross-Category Domain-Generalizable Object Perception and Manipulation via Generalizable and Actionable …
☆163Oct 29, 2024Updated last year
embodied-generalist / embodied-generalist
View on GitHub
[ICML 2024] LEO: An Embodied Generalist Agent in 3D World
☆487Apr 20, 2025Updated last year
gpapagiannis / miles-imitation
View on GitHub
Template Code for the Paper: MILES: Making Imitation Learning Easy with Self-Supervision
☆19Nov 14, 2024Updated last year
OpenDriveLab / MPI
View on GitHub
[RSS 2024] Learning Manipulation by Predicting Interaction
☆119Jul 2, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
TEA-Lab / Robo-ABC
View on GitHub
[ECCV 2024] 🎉 Official repository of "Robo-ABC: Affordance Generalization Beyond Categories via Semantic Correspondence for Robot Manipu…
☆101Nov 26, 2024Updated last year
huangwl18 / VoxPoser
View on GitHub
VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models
☆826Feb 20, 2025Updated last year
bytedance / GR-MG
View on GitHub
Official implementation of GR-MG
☆90Jan 12, 2025Updated last year
HaoyiZhu / PointCloudMatters
View on GitHub
[NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning
☆92Oct 14, 2024Updated last year
siddk / voltron-robotics
View on GitHub
Voltron: Language-Driven Representation Learning for Robotics
☆236Jul 9, 2023Updated 3 years ago
mkt1412 / GraspGPT_public
View on GitHub
code implementation of GraspGPT and FoundationGrasp
☆151Dec 17, 2025Updated 7 months ago
Nicolinho / RoboVLM
View on GitHub
☆47May 13, 2024Updated 2 years ago
r-pad / flowbot3d
View on GitHub
FlowBot3D: Learning 3D Articulation Flow to Manipulate Articulated Objects
☆34Sep 18, 2023Updated 2 years ago
google-research / language-table
View on GitHub
Suite of human-collected datasets and a multi-task continuous control benchmark for open vocabulary visuolinguomotor learning.
☆363Jul 2, 2026Updated 3 weeks ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
kyegomez / RT-X
View on GitHub
Pytorch implementation of the models RT-1-X and RT-2-X from the paper: "Open X-Embodiment: Robotic Learning Datasets and RT-X Models"
☆244Updated this week
Large-Trajectory-Model / ATM
View on GitHub
Official codebase for "Any-point Trajectory Modeling for Policy Learning"
☆278Jun 19, 2025Updated last year
moka-manipulation / moka
View on GitHub
MOKA: Open-World Robotic Manipulation through Mark-based Visual Prompting (RSS 2024)
☆101Jul 16, 2024Updated 2 years ago
facebookresearch / r3m
View on GitHub
Pre-training Reusable Representations for Robotic Manipulation Using Diverse Human Video Data
☆378Mar 21, 2023Updated 3 years ago
ayushjain1144 / ebmplanner
View on GitHub
Code for the RSS 2023 paper "Energy-based Models are Zero-Shot Planners for Compositional Scene Rearrangement"
☆21Jul 4, 2023Updated 3 years ago
shikharbahl / vrb
View on GitHub
☆137Apr 25, 2023Updated 3 years ago
Zheng-Liming / RoboCAS-v0
View on GitHub
☆31Jun 24, 2024Updated 2 years ago