gtziafas / OWG
OWG: Towards Open-World Grasping with Large Vision-Language Models
☆17Updated 3 months ago
Alternatives and similar repositories for OWG:
Users that are interested in OWG are comparing it to the libraries listed below
- Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning☆42Updated 3 weeks ago
- Code release for SceneReplica paper.☆20Updated 3 months ago
- Human Demo Videos to Robot Action Plans☆43Updated 3 months ago
- [IROS24 Oral]ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models☆84Updated 6 months ago
- Repo for Bring Your Own Vision-Language-Action (VLA) model, arxiv 2024☆27Updated 3 weeks ago
- Manipulate-Anything: Automating Real-World Robots using Vision-Language Models [CoRL 2024]☆15Updated 2 months ago
- ☆41Updated 3 months ago
- Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"☆43Updated 10 months ago
- ☆23Updated 7 months ago
- Sim-Grasp offers a simulation framework to generate synthetic data and train models for robotic two finger grasping in cluttered environm…☆25Updated 9 months ago
- ☆60Updated this week
- [ICRA 2024] Dream2Real: Zero-Shot 3D Object Rearrangement with Vision-Language Models☆66Updated last year
- ☆29Updated last year
- IROS 2023 "VL-Grasp: a 6-Dof Interactive Grasp Policy for Language-Oriented Objects in Cluttered Indoor Scenes"☆34Updated 10 months ago
- Official code of paper "DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution"☆66Updated last week
- Given an RGBD image and a text prompt, ForceSight produces visual-force goals for a robot, enabling mobile manipulation in unseen environ…☆17Updated last year
- [CoRL2024] Official repo of `A3VLM: Actionable Articulation-Aware Vision Language Model`☆104Updated 4 months ago
- Official Code Repo for GENIMA☆65Updated 4 months ago
- [ICRA2023] Grounding Language with Visual Affordances over Unstructured Data☆39Updated last year
- ☆65Updated 4 months ago
- ☆35Updated 2 months ago
- Implementation of Language-Conditioned Path Planning (Amber Xie, Youngwoon Lee, Pieter Abbeel, Stephen James)☆22Updated last year
- Official implementation of RAM: Retrieval-Based Affordance Transfer for Generalizable Zero-Shot Robotic Manipulation☆75Updated last month
- This is the official repo for [CoRL 2024] Contrastive Imitation Learning for Language-guided Multi-Task Robotic Manipulation☆21Updated 3 months ago
- Code release for paper "Autonomous Improvement of Instruction Following Skills via Foundation Models" | CoRL 2024☆62Updated last month
- This repo contains the official implementation of CoRL2023 paper "Language-guided Robot Grasping: CLIP-based Referring Grasp Synthesis in…☆12Updated 9 months ago
- [ICLR 25] Code for "Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask Learning"☆34Updated this week
- ☆64Updated this week