H-Freax / Awesome-Video-Robotic-Papers
This repository compiles a list of papers related to the application of video technology in the field of robotics! Star⭐ the repo and follow me if you like what you see🤩.
☆114Updated last month
Related projects: ⓘ
- Code for MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World☆115Updated 6 months ago
- Theia: Distilling Diverse Vision Foundation Models for Robot Learning☆141Updated 3 weeks ago
- Code for subgoal synthesis via image editing☆99Updated 10 months ago
- [ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model☆314Updated 2 months ago
- Code for the paper "3D Diffuser Actor: Policy Diffusion with 3D Scene Representations"☆196Updated last month
- LLaRA: Large Language and Robotics Assistant☆137Updated 2 weeks ago
- Embodied Chain of Thought: A robotic policy that reason to solve the task.☆53Updated 3 weeks ago
- Official repository of Learning to Act from Actionless Videos through Dense Correspondences.☆159Updated 4 months ago
- Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"☆95Updated 4 months ago
- The official codebase for ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation(cvpr 2024)☆69Updated 2 months ago
- Official codebase for "Any-point Trajectory Modeling for Policy Learning"☆145Updated last month
- [ICCV 2023] Official code repository for ARNOLD benchmark☆134Updated 5 months ago
- Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Goo…☆251Updated last week
- The repo of paper `RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation`☆43Updated 3 months ago
- Pytorch implementation of the models RT-1-X and RT-2-X from the paper: "Open X-Embodiment: Robotic Learning Datasets and RT-X Models"☆156Updated this week
- ☆132Updated 3 weeks ago
- [RSS 2024] 3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations☆375Updated last week
- Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model☆323Updated 2 months ago
- [CoRL2024] Official repo of `A3VLM: Actionable Articulation-Aware Vision Language Model`☆63Updated this week
- [arXiv 2023] Embodied Task Planning with Large Language Models☆148Updated last year
- The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.☆124Updated this week
- ☆63Updated last month
- A list of awesome and popular robot learning environments☆87Updated last month
- A Survey of Embodied Learning for Object-Centric Robotic Manipulation☆87Updated 2 weeks ago
- F3RM: Feature Fields for Robotic Manipulation. Official repo for the paper "Distilled Feature Fields Enable Few-Shot Language-Guided Mani…☆180Updated 4 months ago
- [ICML 2024] Official code repository for 3D embodied generalist agent LEO☆330Updated last month
- "MimicPlay: Long-Horizon Imitation Learning by Watching Human Play" code repository☆216Updated 4 months ago
- GenSim: Generating Robotic Simulation Tasks via Large Language Models☆278Updated 5 months ago
- Distributed Robot Interaction Dataset.☆108Updated last month
- DROID Policy Learning and Evaluation☆133Updated 3 months ago