RogerQi / Tracking_SAMLinks
Language/Clicking grounded SAM + VOS for real-time video object tracking
☆19Updated 6 months ago
Alternatives and similar repositories for Tracking_SAM
Users that are interested in Tracking_SAM are comparing it to the libraries listed below
Sorting:
- Official implementation of GROOT, CoRL 2023☆62Updated last year
- Code for Point Policy: Unifying Observations and Actions with Key Points for Robot Manipulation☆75Updated 3 weeks ago
- ☆69Updated 9 months ago
- Official codebase of paper "ManiWAV: Learning Robot Manipulation from In-the-Wild Audio-Visual Data"☆56Updated last year
- ☆56Updated 6 months ago
- [ICRA 2024] Dream2Real: Zero-Shot 3D Object Rearrangement with Vision-Language Models☆69Updated last year
- Official implementation of Crossing the Human-Robot Embodiment Gap with Sim-to-Real RL using One Human Demonstration☆101Updated 2 months ago
- ☆70Updated 5 months ago
- [ICRA 2025] In-Context Imitation Learning via Next-Token Prediction☆87Updated 4 months ago
- Repo for Bring Your Own Vision-Language-Action (VLA) model, arxiv 2024☆30Updated 6 months ago
- Code release for paper "Autonomous Improvement of Instruction Following Skills via Foundation Models" | CoRL 2024☆73Updated 7 months ago
- Official Code for SGRv2 and SGR.☆30Updated 2 months ago
- Official Code Repo for GENIMA☆74Updated 10 months ago
- ☆34Updated 3 weeks ago
- [CoRL 2024] ClutterGen: A Cluttered Scene Generator for Robot Learning☆40Updated 10 months ago
- ☆39Updated 2 months ago
- ☆19Updated 5 months ago
- This repository provides the sample code designed to interpret human demonstration videos and convert them into high-level tasks for robo…☆42Updated 9 months ago
- Accompanying codebase for paper"Touch begins where vision ends: Generalizable policies for contact-rich manipulation"☆84Updated last month
- ☆88Updated last year
- Streaming Diffusion Policy: Fast Policy Synthesis with Variable Noise Diffusion Models☆65Updated 2 months ago
- Code base for See to Touch project: https://see-to-touch.github.io/☆52Updated last year
- ViTacFormer: Learning Cross-Modal Representation for Visuo-Tactile Dexterous Manipulation☆52Updated 3 weeks ago
- ☆55Updated 4 months ago
- A list of awesome and popular robot learning environments☆111Updated 11 months ago
- ☆120Updated 2 years ago
- ☆36Updated this week
- ☆72Updated 9 months ago
- SDP☆63Updated 10 months ago
- ☆89Updated 11 months ago