☆202Aug 1, 2025Updated 9 months ago
Alternatives and similar repositories for villa-x
Users that are interested in villa-x are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ICCV2025☆168Dec 10, 2025Updated 5 months ago
- [ICCV2025 Oral] Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos☆175Oct 1, 2025Updated 7 months ago
- [RSS 2025] Gripper Keypose and Object Pointflow as Interfaces for Bimanual Robotic Manipulation☆79Jul 22, 2025Updated 10 months ago
- Official code for "QueST: Self-Supervised Skill Abstractions for Continuous Control" [NeurIPS 2024]☆113Nov 21, 2024Updated last year
- ☆63Mar 3, 2026Updated 2 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [NeurIPS 2025] DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge☆340Jan 6, 2026Updated 4 months ago
- [ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation☆301Jul 8, 2025Updated 10 months ago
- [ICCV 2025] RAGNet: Large-scale Reasoning-based Affordance Segmentation Benchmark towards General Grasping☆47Nov 21, 2025Updated 6 months ago
- Code for "ACG: Action Coherence Guidance for Flow-based Vision-Language-Action Models" (ICRA 2026)☆80Mar 11, 2026Updated 2 months ago
- Official PyTorch Implementation of Unified Video Action Model (RSS 2025)☆380Jul 23, 2025Updated 10 months ago
- [ICML'25] The PyTorch implementation of paper: "AdaWorld: Learning Adaptable World Models with Latent Actions".☆241Jun 17, 2025Updated 11 months ago
- [ICML 2025] OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction☆116Apr 14, 2025Updated last year
- [NeurIPS 2025] Source codes for the paper "MindJourney: Test-Time Scaling with World Models for Spatial Reasoning"☆146Nov 4, 2025Updated 6 months ago
- [RSS 2025] Learning to Act Anywhere with Task-centric Latent Actions☆1,080Nov 19, 2025Updated 6 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The official implementation of Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight☆90Jan 16, 2026Updated 4 months ago
- Code for the ICLR 2024 spotlight paper: "Learning to Act without Actions" (introducing Latent Action Policies)☆141Jul 31, 2024Updated last year
- Handeye calibration for FR3 & Realsense with Ros2. Using Ros2 Humble, easy_handeye2, ros2_aruco.☆21Jun 4, 2025Updated 11 months ago
- A supervised learning trained reward head for ACT☆143Apr 21, 2026Updated last month
- [CoRL 2024 Outstanding Paper Award Finalist] Equivariant Diffusion Policy☆132Feb 13, 2025Updated last year
- The repository provides code for EgoMAN model and dataset creation scripts.☆31Dec 31, 2025Updated 4 months ago
- AWS World implementation for Workflow DevKit - Run durable workflows on AWS Lambda with DynamoDB, SQS, and S3☆41Oct 28, 2025Updated 7 months ago
- ☆38Dec 18, 2025Updated 5 months ago
- Official code of RDT 2☆773Feb 7, 2026Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Being-H is BeingBeyond's family of human-centric embodied foundation models.☆964May 17, 2026Updated last week
- Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Goo…☆1,076Dec 20, 2025Updated 5 months ago
- This is the official code repo for GLOVER and GLOVER++.☆55Aug 6, 2025Updated 9 months ago
- Accept by RSS 2026☆143May 1, 2026Updated 3 weeks ago
- Code for Point Policy: Unifying Observations and Actions with Key Points for Robot Manipulation☆90Jul 21, 2025Updated 10 months ago
- [ICLR 2025] LAPA: Latent Action Pretraining from Videos☆526Jan 22, 2025Updated last year
- A unified robotic manipulation learning framework☆22Sep 4, 2025Updated 8 months ago
- Repository to train and evaluate RoboAgent☆368Apr 2, 2024Updated 2 years ago
- HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction☆41Sep 15, 2025Updated 8 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official Algorithm Codebase for the Paper "BEHAVIOR Robot Suite: Streamlining Real-World Whole-Body Manipulation for Everyday Household A…☆170Aug 24, 2025Updated 9 months ago
- EgoDex: Learning Dexterous Manipulation from Large-Scale Egocentric Video☆266Aug 20, 2025Updated 9 months ago
- library to finetune VLAs☆57Feb 7, 2026Updated 3 months ago
- ☆15Nov 18, 2025Updated 6 months ago
- [ICML 2024] The offical Implementation of "DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning"☆82May 26, 2025Updated last year
- Prototyping mujoco simulation environments.☆11Feb 20, 2025Updated last year
- [CoRL 2024] ScissorBot: Learning Generalizable Scissor Skill for Paper Cutting via Simulation, Imitation, and Sim2Real☆15Dec 25, 2024Updated last year