Vision-Language-Action Optimization with Trajectory Ensemble Voting
☆25Feb 18, 2026Updated last week
Alternatives and similar repositories for vote
Users that are interested in vote are comparing it to the libraries listed below
Sorting:
- Initial commit☆12Aug 14, 2023Updated 2 years ago
- [ECCV 2024] Language-Driven 6-DoF Grasp Detection Using Negative Prompt Guidance☆40Sep 7, 2024Updated last year
- ☆14Feb 13, 2025Updated last year
- [ICRA 2024] SG-Bot: Object Rearrangement via Coarse-to-Fine Robotic Imagination on Scene Graphs☆20Jun 1, 2024Updated last year
- Official implementation of Points2Plans: From Point Clouds to Long-Horizon Plans with Composable Relational Dynamics☆41Mar 11, 2025Updated 11 months ago
- ☆19Feb 6, 2025Updated last year
- [NeurIPS 2024] Key-Grid: Unsupervised 3D Keypoints Detection using Grid Heatmap Features☆24Mar 20, 2025Updated 11 months ago
- Code for the paper Robot Data Curation with Mutual Information Estimators☆29Apr 22, 2025Updated 10 months ago
- Repository to extract obj files in world frame from a URDF description☆21Jan 7, 2023Updated 3 years ago
- Official code for "One-Shot Manipulation Strategy Learning by Making Contact Analogies".☆26Feb 7, 2025Updated last year
- CVPR2025 | TASTE-Rob: Advancing Video Generation of Task-Oriented Hand-Object Interaction for Generalizable Robotic Manipulation☆34Jan 29, 2026Updated last month
- This is a framework for evaluating reasoning in foundational Video Models.☆57Updated this week
- Code release for "RoboPrompt"☆27Sep 30, 2025Updated 5 months ago
- ☆68Jan 8, 2025Updated last year
- [CoRL 2024] OrbitGrasp: SE(3)-Equivariant Grasp Learning☆28Dec 9, 2024Updated last year
- PRIN/SPRIN: On Extracting Point-wise Rotation Invariant Features☆30Mar 15, 2022Updated 3 years ago
- Code for Equivariant Transporter Network☆23Apr 17, 2023Updated 2 years ago
- Fast-in-Slow: A Dual-System Foundation Model Unifying Fast Manipulation within Slow Reasoning☆143Aug 1, 2025Updated 7 months ago
- [CVPR 2022] Neural Shape Mating: Self-Supervised Object Assembly with Adversarial Shape Priors☆31Jun 20, 2022Updated 3 years ago
- ☆75Jan 8, 2025Updated last year
- Official Code for SGRv2 and SGR.☆33May 20, 2025Updated 9 months ago
- Code for paper "Diff-Control: A stateful Diffusion-based Policy for Imitation Learning" (Liu et al., IROS 2024)☆74May 28, 2025Updated 9 months ago
- Official PyTorch Implementation of Learning Affordance Grounding from Exocentric Images, CVPR 2022☆72Nov 1, 2024Updated last year
- Augment robotics demonstration datasets with different robots and viewpoints☆40Feb 27, 2025Updated last year
- Efficiently apply modification functions to RLDS/TFDS datasets.☆41Jun 5, 2024Updated last year
- ☆56Aug 7, 2025Updated 6 months ago
- [NeurIPS 2025] VIKI‑R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning☆74Dec 14, 2025Updated 2 months ago
- Repo for Bring Your Own Vision-Language-Action (VLA) model, arxiv 2024☆36Jan 22, 2025Updated last year
- [IROS 2023] Open-Vocabulary Affordance Detection in 3d Point Clouds☆82Sep 4, 2024Updated last year
- [NeurIPS 2024] Official implementation of "NeuralClothSim: Neural Deformation Fields Meet the Thin Shell Theory"☆41Oct 29, 2024Updated last year
- ☆19Sep 29, 2025Updated 5 months ago
- [CVPR 2024] G3DR: Generative 3D Reconstruction in ImageNet☆38Jun 27, 2024Updated last year
- [CVPR 2023] Segmenting objects in videos without human annotations 🤯: Official implementation for Bootstrapping Objectness from Videos b…☆38Nov 23, 2023Updated 2 years ago
- ImOV3D: Learning Open Vocabulary Point Clouds 3D Object Detection from Only 2D Images (NeurIPS2024)☆88Feb 20, 2026Updated last week
- LOCATE: Localize and Transfer Object Parts for Weakly Supervised Affordance Grounding (CVPR 2023)☆46Apr 28, 2023Updated 2 years ago
- [ICRA 2024] Language-Conditioned Affordance-Pose Detection in 3D Point Clouds☆49Jan 10, 2025Updated last year
- [AAAI26 oral] CronusVLA: Towards Efficient and Robust Manipulation via Multi-Frame Vision-Language-Action Modeling☆88Jan 11, 2026Updated last month
- MOKA: Open-World Robotic Manipulation through Mark-based Visual Prompting (RSS 2024)☆94Jul 16, 2024Updated last year
- Our repo containes a Efficient RGB-D features extractor to category-level and instance-level 6D pose estimation.☆14Oct 29, 2025Updated 4 months ago