ZiliangMiao / Multimodal_Large_Language_Model_ResearchLinks
☆35Updated 2 years ago
Alternatives and similar repositories for Multimodal_Large_Language_Model_Research
Users that are interested in Multimodal_Large_Language_Model_Research are comparing it to the libraries listed below
Sorting:
- ☆83Updated last year
- Code for LGX (Language Guided Exploration). We use LLMs to perform embodied robot navigation in a zero-shot manner.☆62Updated last year
- Mobile manipulation in Habitat☆88Updated 5 months ago
- GAMMA: Graspability-Aware Mobile MAnipulation Policy Learning based on Online Grasping Pose Fusion☆75Updated 7 months ago
- Code repository for DynaCon: Dynamic Robot Planner with Contextual Awareness via LLMs. This package is for ROS Noetic.☆22Updated last year
- 2023 Mobile Robot Grasping and Navigation Challenge☆21Updated 2 years ago
- ☆64Updated this week
- Official implementation of paper on Nature Machine Intelligence: "Preserving and Combining Knowledge in Robotic Lifelong Reinforcement Le…☆80Updated 2 months ago
- ☆39Updated last year
- This is the official implementation of the paper "ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency Policy".☆115Updated 3 weeks ago
- Code for Prompt a Robot to Walk with Large Language Models https://arxiv.org/abs/2309.09969☆102Updated last year
- Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"☆44Updated last year
- Enhancing LLM/VLM capability for robot task and motion planning with extra algorithm based tools.☆69Updated 8 months ago
- [IROS24 Oral]ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models☆93Updated 9 months ago
- This is the repo of "Learning to Manipulate Anywhere: A Visual Generalizable Framework For Reinforcement Learning"☆60Updated 5 months ago
- [NeurIPS 2024] PIVOT-R: Primitive-Driven Waypoint-Aware World Model for Robotic Manipulation☆38Updated 7 months ago
- ManiBox: Enhancing Spatial Grasping Generalization via Scalable Simulation Data Generation☆45Updated last month
- ☆39Updated last year
- ☆63Updated 3 months ago
- Official implementation of Matcha-agent, https://arxiv.org/abs/2303.08268☆26Updated 9 months ago
- Vision-Language Navigation Benchmark in Isaac Lab☆179Updated this week
- Official Code for "From Cognition to Precognition: A Future-Aware Framework for Social Navigation" (ICRA 2025)☆37Updated last month
- ☆78Updated this week
- A collection of papers, codes and talks of visual imitation learning/imitation learning from video for robotics.☆69Updated 2 years ago
- ☆52Updated last month
- Paper and summaries about state-of-the-art robot Target-driven Navigation task☆47Updated 3 years ago
- LLM3: Large Language Model-based Task and Motion Planning with Motion Failure Reasoning☆83Updated last year
- Codebase for paper: RoCo: Dialectic Multi-Robot Collaboration with Large Language Models☆210Updated last year
- Official code for: Observe Then Act: Asynchronous Active Vision-Action Model for Robotic Manipulation (OTA)☆18Updated 5 months ago
- ☆101Updated 6 months ago