☆64Dec 14, 2024Updated last year
Alternatives and similar repositories for LLARVA
Users that are interested in LLARVA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICRA 2025] RACER: Rich Language-Guided Failure Recovery Policies for Imitation Learning☆44Oct 10, 2024Updated last year
- Official release for SplArt: Articulation Estimation and Part-level Reconstruction with 3D Gaussian Splatting.☆31Jun 5, 2025Updated last year
- [CVPR 2025] RoboGround: Robotic Manipulation with Grounded Vision-Language Priors☆47May 25, 2025Updated last year
- [RSS 2024] Code for "Multimodal Diffusion Transformer: Learning Versatile Behavior from Multimodal Goals" for CALVIN experiments with pre…☆167Oct 16, 2024Updated last year
- [ICLR'25] LLaRA: Supercharging Robot Learning Data for Vision-Language Policy☆229Mar 29, 2025Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- [CoRL 25] Code for FLOWER VLA for finetuning FLOWER on CALVIN and all LIBERO environments☆90Sep 22, 2025Updated 8 months ago
- Embodied Chain of Thought: A robotic policy that reason to solve the task.☆398Apr 5, 2025Updated last year
- Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Goo…☆1,086Dec 20, 2025Updated 5 months ago
- ☆33Sep 25, 2024Updated last year
- [ICRA 2025] A Parameter-Efficient Tuning Framework for Language-guided Object Grounding and Robot Grasping☆12Feb 7, 2025Updated last year
- [CoRL2024] Official repo of `A3VLM: Actionable Articulation-Aware Vision Language Model`☆122Oct 7, 2024Updated last year
- Official Implementation of ARM4R ICML 2025☆54Sep 18, 2025Updated 8 months ago
- Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"☆312Apr 22, 2024Updated 2 years ago
- [AAAI26 oral] CronusVLA: Towards Efficient and Robust Manipulation via Multi-Frame Vision-Language-Action Modeling☆105Jan 11, 2026Updated 5 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- A Vision-Language Model for Spatial Affordance Prediction in Robotics☆223Jul 17, 2025Updated 10 months ago
- ☆31Jun 24, 2024Updated last year
- ☆37Dec 13, 2023Updated 2 years ago
- Code for the paper "Trust the PRoC3S: Solving Long-Horizon Robotics Problems with LLMs and Constraint Satisfaction" presented at CoRL 202…☆32Nov 18, 2024Updated last year
- This project aims to reproduce Pi0, a general-purpose robot foundation model developed by Physical Intelligence.☆19Feb 23, 2025Updated last year
- Autoregressive Policy for Robot Learning (RA-L 2025)☆151Mar 25, 2025Updated last year
- ☆59Jul 4, 2025Updated 11 months ago
- GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization☆161Apr 6, 2025Updated last year
- Adapter-X: A Novel General Parameter-Efficient Fine-Tuning Framework for Vision☆11Jul 22, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- (Incomplete version) This is an implementation of affordancellm.☆19Oct 17, 2024Updated last year
- ☆473Apr 14, 2026Updated 2 months ago
- MOKA: Open-World Robotic Manipulation through Mark-based Visual Prompting (RSS 2024)☆101Jul 16, 2024Updated last year
- Code for the paper "3D Diffuser Actor: Policy Diffusion with 3D Scene Representations"☆391Aug 17, 2024Updated last year
- [NeurIPS 2024] PIVOT-R: Primitive-Driven Waypoint-Aware World Model for Robotic Manipulation☆49Oct 31, 2024Updated last year
- ☆69Feb 17, 2025Updated last year
- ☆77Oct 18, 2024Updated last year
- Official implementation of RAM: Retrieval-Based Affordance Transfer for Generalizable Zero-Shot Robotic Manipulation☆101Dec 30, 2024Updated last year
- ☆95Aug 29, 2025Updated 9 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Heterogeneous Pre-trained Transformer (HPT) as Scalable Policy Learner.☆538Dec 6, 2024Updated last year
- Specialized encoders for robot manipulation. Sparsh-Skin An encoder tailored for magnetic tactile sensors to understand interactions from…☆39Aug 20, 2025Updated 9 months ago
- [IROS24 Oral]ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models☆102Aug 22, 2024Updated last year
- ☆283Aug 26, 2024Updated last year
- Official implementation of "Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy."☆130May 26, 2026Updated 2 weeks ago
- [ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model☆626Oct 29, 2024Updated last year
- ☆22Apr 17, 2026Updated last month