This is the completion of google's rt-1 project code and can run directly.
☆37Aug 16, 2024Updated last year
Alternatives and similar repositories for RT-1
Users that are interested in RT-1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆35Aug 27, 2025Updated 7 months ago
- A PyTorch re-implementation of the RT-1 (Robotics Transformer)☆50Oct 18, 2023Updated 2 years ago
- Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"☆45Apr 19, 2024Updated last year
- ☆1,695Jan 31, 2024Updated 2 years ago
- ☆64Feb 20, 2025Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- VR Hand Tracking with Meta Quest 3s☆29Aug 7, 2025Updated 8 months ago
- Suite of human-collected datasets and a multi-task continuous control benchmark for open vocabulary visuolinguomotor learning.☆355Mar 30, 2026Updated last week
- A PyTorch re-implementation of the RT-1 (Robotics Transformer) with training and testing pipeline☆59Mar 21, 2024Updated 2 years ago
- Pytorch implementation of the models RT-1-X and RT-2-X from the paper: "Open X-Embodiment: Robotic Learning Datasets and RT-X Models"☆239Apr 2, 2026Updated last week
- simplify cartographer interface, support using ros or not(cartographer 算法的极度简化接口实现,能自行修改选择是否使用ros)☆17Nov 8, 2021Updated 4 years ago
- [arXiv 2025] GMR: General Motion Retargeting. Retarget human motions into diverse humanoid robots in real time on CPU. Retargeter for TWI…☆69Oct 10, 2025Updated 5 months ago
- ☆17Sep 25, 2024Updated last year
- [NeurIPS 2023] The official implementation of SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation☆33Mar 16, 2024Updated 2 years ago
- [RA-L25/ICRA26] HybridTrack: A Hybrid Approach for Robust Multi-Object Tracking☆40Dec 17, 2025Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- SWN-GCN: Graph-based Rotation Equivariant Network (BMVC 2021)☆10Oct 18, 2021Updated 4 years ago
- ☆11Nov 11, 2022Updated 3 years ago
- [CVPR 2025] Official implementation of SSP: High Temporal Consistency through Semantic Similarity Propagation in Semi-Supervised Video Se…☆15Jun 26, 2025Updated 9 months ago
- 2023年哈工大软件工程834考研资料☆27Mar 30, 2023Updated 3 years ago
- This repo hosts the code for the Fast Trainable Projection (FTP) project.☆12Nov 16, 2023Updated 2 years ago
- ☆24Jan 3, 2025Updated last year
- ☆19Aug 21, 2024Updated last year
- ☆15Oct 3, 2025Updated 6 months ago
- team Doggeee's solution to Ego4D LTA challenge@CVPRW23'☆14Nov 4, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…☆13May 25, 2023Updated 2 years ago
- [ICRA2023] Grounding Language with Visual Affordances over Unstructured Data☆47Oct 29, 2023Updated 2 years ago
- ☆47Aug 8, 2024Updated last year
- 2D detection on KITTI dataset. see configs/kitti☆16Jul 7, 2021Updated 4 years ago
- ☆33Sep 25, 2024Updated last year
- Video + CLIP Baseline for Ego4D Long Term Action Anticipation Challenge (CVPR 2022)☆15Jul 4, 2022Updated 3 years ago
- Multi-Modal Tree of thoughts for DALLE-3 like auto self improvement☆17Nov 11, 2024Updated last year
- ☆31Jun 24, 2024Updated last year
- Source Code for View Consistent Purification for Accurate Cross-View Localization, ICCV 2023☆19Nov 24, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official implementation of "Interpreting and Controlling Vision Foundation Models via Text Explanations"☆14May 29, 2024Updated last year
- [CoRL 2023] This repository contains data generation and training code for Scaling Up & Distilling Down☆407Aug 12, 2024Updated last year
- Implementation of "PaLM-E: An Embodied Multimodal Language Model"☆334Jan 29, 2024Updated 2 years ago
- ☆13Jun 28, 2021Updated 4 years ago
- A Gaze Tracker using a Hourglass Convolutional Neural Network☆10Dec 4, 2018Updated 7 years ago
- A unified architecture for multimodal multi-task robotic policy learning.☆178Feb 2, 2024Updated 2 years ago
- ☆55Apr 1, 2022Updated 4 years ago