☆51Oct 9, 2024Updated last year
Alternatives and similar repositories for SAM-E
Users that are interested in SAM-E are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Code for SGRv2 and SGR.☆33May 20, 2025Updated 10 months ago
- [arXiv 2024] Articulated Object Manipulation using Online Axis Estimation with SAM2-Based Tracking☆18Apr 4, 2025Updated 11 months ago
- ACM MM 2022 - PPMN: Pixel-Phrase Matching Network for One-Stage Panoptic Narrative Grounding☆11Aug 12, 2022Updated 3 years ago
- Official Code Repo for GENIMA☆77Oct 29, 2025Updated 5 months ago
- [RSS 2024] Learning Manipulation by Predicting Interaction☆120Jul 2, 2025Updated 8 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [CoRL 2025] Pretraining code for FLOWER VLA on OXE☆37Sep 22, 2025Updated 6 months ago
- A unified architecture for multimodal multi-task robotic policy learning.☆178Feb 2, 2024Updated 2 years ago
- ☆19Jul 7, 2024Updated last year
- Official Code for RVT-2 and RVT☆401Feb 14, 2025Updated last year
- [ACL 2025 Findings] Text2World: Benchmarking Large Language Models for Symbolic World Model Generation☆28Feb 25, 2025Updated last year
- Official Code Repository for the POLICEd-RL Paper: https://www.roboticsproceedings.org/rss20/p104.html☆13Mar 4, 2025Updated last year
- ☆32Mar 10, 2024Updated 2 years ago
- [CVPR 25] G3Flow: Generative 3D Semantic Flow for Pose-aware and Generalizable Object Manipulation☆93Jun 6, 2025Updated 9 months ago
- Implementation of PatchAIL in the ICLR 2023 paper <Visual Imitation with Patch Rewards>☆14Feb 15, 2023Updated 3 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- [CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation☆181Jun 20, 2025Updated 9 months ago
- F3RM: Feature Fields for Robotic Manipulation. Official repo for the paper "Distilled Feature Fields Enable Few-Shot Language-Guided Mani…☆219Apr 26, 2024Updated last year
- A Benchmark for Evaluating Generalization for Robotic Manipulation☆146Mar 3, 2025Updated last year
- Learning Active Force-torque based Policy for Sub-mm Localization of Unseen Holes☆21Jul 25, 2023Updated 2 years ago
- [CVPR 2024] Hierarchical Diffusion Policy for Multi-Task Robotic Manipulation☆230Apr 9, 2024Updated last year
- [CoRL2023] Official PyTorch implementation of PolarNet: 3D Point Clouds for Language-Guided Robotic Manipulation☆42Jun 4, 2024Updated last year
- ☆12Jul 8, 2024Updated last year
- Official implementation of "Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy."☆125Oct 23, 2025Updated 5 months ago
- Code for the paper "3D FlowMatch Actor: Unified 3D Policy for Single- and Dual-Arm Manipulation"☆33Aug 18, 2025Updated 7 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A Vision-Language Model for Spatial Affordance Prediction in Robotics☆215Jul 17, 2025Updated 8 months ago
- QuadWBG: Generalizable Quadrupedal Whole-Body Grasping☆24Nov 8, 2024Updated last year
- Code for PerAct², a language-conditioned imitation learning agent designed for bimanual robotic manipulation using the RLBench environmen…☆119Feb 23, 2025Updated last year
- Code for CoRL 2022 paper: https://arxiv.org/abs/2211.09006 (ToolFlowNet, for simulation envs)☆11Mar 16, 2023Updated 3 years ago
- InterPreT: Interactive Predicate Learning from Language Feedback for Generalizable Task Planning (RSS 2024)☆30Jun 18, 2024Updated last year
- Official Implementation of "Open-Vocabulary Audio-Visual Semantic Segmentation" [ACM MM 2024 Oral].☆35Nov 2, 2024Updated last year
- [TNNLS] Toward Explainable and Fine-Grained 3D Grounding through Referring Textual Phrases☆16Jul 10, 2025Updated 8 months ago
- ☆90Sep 23, 2025Updated 6 months ago
- [NeurIPS 2024] GenRL: Multimodal-foundation world models enable grounding language and video prompts into embodied domains, by turning th…☆86Apr 4, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆19Sep 2, 2025Updated 6 months ago
- [IROS 2024] LEEPS : Learning End-to-End Legged Perceptive Parkour Skills on Challenging Terrains☆78Jul 2, 2024Updated last year
- Code for the paper "Trust the PRoC3S: Solving Long-Horizon Robotics Problems with LLMs and Constraint Satisfaction" presented at CoRL 202…☆31Nov 18, 2024Updated last year
- Learning Real-World Action-Video Dynamics with Heterogeneous Masked Autoregression☆43Feb 17, 2025Updated last year
- HACMan++ code release. RSS 2024.☆22Dec 23, 2024Updated last year
- Code for the paper "3D Diffuser Actor: Policy Diffusion with 3D Scene Representations"☆385Aug 17, 2024Updated last year
- Coarse-to-fine Q-Network☆59Aug 6, 2024Updated last year