UMass-Foundation-Model / 3D-VLA
[ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model
☆349Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for 3D-VLA
- [ICML 2024] Official code repository for 3D embodied generalist agent LEO☆365Updated last month
- Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"☆184Updated 6 months ago
- Official codebase for "Any-point Trajectory Modeling for Policy Learning"☆178Updated 3 months ago
- Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Goo…☆326Updated 3 weeks ago
- Code for the paper "3D Diffuser Actor: Policy Diffusion with 3D Scene Representations"☆231Updated 3 months ago
- Code for RoboFlamingo☆311Updated 6 months ago
- [RSS 2024] 3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations☆513Updated this week
- Code for MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World☆122Updated 3 weeks ago
- Official repository of Learning to Act from Actionless Videos through Dense Correspondences.☆173Updated 6 months ago
- Official Code for RVT-2 and RVT☆289Updated 4 months ago
- [ECCV 2024] ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation☆176Updated 2 weeks ago
- The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.☆163Updated last month
- A curated list of 3D Vision papers relating to Robotics domain in the era of large models i.e. LLMs/VLMs, inspired by awesome-computer-vi…☆553Updated 2 weeks ago
- Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model☆334Updated 4 months ago
- Theia: Distilling Diverse Vision Foundation Models for Robot Learning☆179Updated last month
- This repository compiles a list of papers related to the application of video technology in the field of robotics! Star⭐ the repo and fol…☆123Updated 3 months ago
- RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation☆472Updated last week
- Embodied Chain of Thought: A robotic policy that reason to solve the task.☆93Updated 2 months ago
- Generating Robotic Simulation Tasks via Large Language Models☆295Updated 7 months ago
- A generative world for general-purpose robotics & embodied AI learning.☆372Updated 8 months ago
- RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots☆568Updated this week
- F3RM: Feature Fields for Robotic Manipulation. Official repo for the paper "Distilled Feature Fields Enable Few-Shot Language-Guided Mani…☆187Updated 6 months ago
- Official implementation for paper "EquiBot: SIM(3)-Equivariant Diffusion Policy for Generalizable and Data Efficient Learning".☆120Updated 4 months ago
- ☆187Updated 2 months ago
- A universal summary of current robotics simulators☆305Updated this week
- "MimicPlay: Long-Horizon Imitation Learning by Watching Human Play" code repository☆235Updated 6 months ago
- Code for subgoal synthesis via image editing☆113Updated last year
- [CoRL2024] Official repo of `A3VLM: Actionable Articulation-Aware Vision Language Model`☆89Updated last month
- [ECCV 2024] ShapeLLM: Universal 3D Object Understanding for Embodied Interaction☆144Updated last month
- A generative and self-guided robotic agent that endlessly propose and master new skills.☆597Updated 5 months ago