bdaiinstitute / detic-samLinks
Detic + SAM for open-vocabulary object detection and segmentation.
☆19Updated 2 months ago
Alternatives and similar repositories for detic-sam
Users that are interested in detic-sam are comparing it to the libraries listed below
Sorting:
- Code for RRL (https://sites.google.com/view/abstractions4rl)☆27Updated 4 years ago
- Temporally Correlated Episodic Reinforcement Learning, ICLR 24☆12Updated last year
- Subtask-Aware Visual Reward Learning from Segmented Demonstrations (ICLR 2025 accepted)☆18Updated 9 months ago
- Code for "DittoGym: Learning to Control Soft Shape-Shifting Robots" by Suning Huang, Boyuan Chen, Huazhe Xu, and Vincent Sitzmann.☆30Updated 8 months ago
- Codebase for HiP☆90Updated 2 years ago
- Galactic Scaling End-to-End Reinforcement Learning for Rearrangement at 100k Steps-Per-Second☆86Updated 2 years ago
- Reshaping Robot Trajectories Using Natural Language Commands: A Study of Multi-Modal Data Alignment Using Transformers☆60Updated 3 years ago
- [ICRA 2024] Dream2Real: Zero-Shot 3D Object Rearrangement with Vision-Language Models☆68Updated last year
- ☆36Updated 3 years ago
- [EMNLP 2023 (Findings)] This repository contains data processing, evaluation, and fine-tuning code for NEWTON: Are Large Language Models …☆40Updated last year
- Annotated Tutorial for PerAct☆19Updated 2 years ago
- ☆18Updated 2 years ago
- Evaluating pre-trained navigation agents under corruptions☆31Updated 4 years ago
- Reimplementation of facebook's DinoV2 in JAX. Inference (with pretrained weights) only; training is unsupported.☆12Updated last year
- ☆35Updated 7 months ago
- Official implementation of "Cross-Domain Transfer via Semantic Skill Imitation", Pertsch et al., CoRL 2022☆14Updated 3 years ago
- MidasTouch: Monte-Carlo inference over distributions across sliding touch☆48Updated 2 years ago
- ☆12Updated 3 years ago
- Language/Clicking grounded SAM + VOS for real-time video object tracking☆20Updated last year
- Slot-TTA shows that test-time adaptation using slot-centric models can improve image segmentation on out-of-distribution examples.☆26Updated 2 years ago
- [CoRL22] Frame Mining - a Free Lunch for Learning Robotic Manipulation from 3D Point Clouds☆29Updated 3 years ago
- Code for the RSS 2023 paper "Energy-based Models are Zero-Shot Planners for Compositional Scene Rearrangement"☆21Updated 2 years ago
- Pytorch implementation of Stable Vector Fields on Lie Groups through Diffeomorphism☆22Updated last year
- ☆23Updated 4 years ago
- This code corresponds to transformer training and evaluation code used as part of the OPTIMUS project.☆82Updated 2 years ago
- Implementation of Deepmind's RoboCat: "Self-Improving Foundation Agent for Robotic Manipulation" An next generation robot LLM☆87Updated 2 years ago
- Task planning over 3D scene graphs☆19Updated 3 years ago
- ☆61Updated 2 years ago
- [CoRL 2024] Official code for "Scaling Robot Policy Learning via Zero-Shot Labeling with Foundation Models"☆28Updated last year
- NeurIPS 2022 Paper "VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation"☆98Updated 8 months ago