(CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation
☆27Feb 28, 2026Updated this week
Alternatives and similar repositories for Long_RVOS
Users that are interested in Long_RVOS are comparing it to the libraries listed below
Sorting:
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆19Jul 10, 2025Updated 7 months ago
- ☆38Oct 16, 2025Updated 4 months ago
- THEORY OF SPACE: a benchmark for evaluating whether foundation models can actively explore under partial observability efficiently to bui…☆40Feb 27, 2026Updated last week
- [WIP] Code for LangToMo☆20Jun 25, 2025Updated 8 months ago
- Pytorch Implementation of "HandNeRF: Learning to Reconstruct Hand-Object Interaction Scene from a Single RGB Image", In ICRA 2024☆26Mar 27, 2024Updated last year
- (ICCV 2025) ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations☆129Nov 14, 2025Updated 3 months ago
- MAPLE infuses dexterous manipulation priors from egocentric videos into vision encoders, making their features well-suited for downstream…☆29Dec 9, 2025Updated 2 months ago
- contact planning for dexterous hand manipulation☆19Jul 8, 2023Updated 2 years ago
- CoRL25-"AimBot: A Simple Auxiliary Visual Cue to Enhance Spatial Awareness of Visuomotor Policies"☆43Aug 15, 2025Updated 6 months ago
- ☆29Dec 9, 2025Updated 2 months ago
- [CVPR 2026] TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs☆105Feb 26, 2026Updated last week
- 4RC: 4D Reconstruction via Conditional Querying Anytime and Anywhere☆89Feb 11, 2026Updated 3 weeks ago
- The open-source code for the NeurIPS 2025 paper, "Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learn…☆45Jan 5, 2026Updated 2 months ago
- [NeurIPS 2025] ARMesh: Autoregressive Mesh Generation via Next-Level-of-Detail Prediction☆61Jan 27, 2026Updated last month
- Implementation of Dream to Manipulate: Compositional World Models Empowering Robot Imitation Learning with Imagination☆36May 7, 2025Updated 9 months ago
- [ICRA 2024] WLST: Weak Labels Guided Self-training for Weakly-supervised Domain Adaptation on 3D Object Detection☆12Feb 6, 2024Updated 2 years ago
- Multi-step AI agents powered by Gemini 2.0 and the LangGraph framework. These agents orchestrate complex workflows and enhance their reas…☆10Dec 19, 2024Updated last year
- [CVPR2026] Code Release of MVInverse: Feedforward Multi-view Inverse Rendering in Seconds☆137Jan 22, 2026Updated last month
- [CVPR 2026] Diff4Splat: Controllable 4D Scene Generation with Latent Dynamic Reconstruction Models.☆79Feb 25, 2026Updated last week
- [ICRA'25] Code for "MonoDiff9D: Monocular Category-Level 9D Object Pose Estimation via Diffusion Model".☆36Jun 3, 2025Updated 9 months ago
- 🌴[CVPR 2024] OakInk2: A Dataset of Bimanual Hands-Object Manipulation in Complex Task Completion☆92Aug 11, 2025Updated 6 months ago
- Official repository for "Visual Generation Unlocks Human-Like Reasoning through Multimodal World Models", https://arxiv.org/abs/2601.1983…☆78Feb 13, 2026Updated 3 weeks ago
- ☆69Nov 5, 2025Updated 4 months ago
- 采用松灵机械臂在Mujoco环境下实现Yolo-world+Sam+Graspnet传统抓取方法☆15Sep 15, 2025Updated 5 months ago
- AI-native knowledge kernel for human/agent collaboration. Use it as a Knowledge Base, Wiki, Annotator, Research Tool, or Agentic Memory.☆29Updated this week
- ROS2 packages for dual arm setup of Kinova robot and control using MoveIt Servo and ArUco pose estimation☆10Jul 27, 2025Updated 7 months ago
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆23Nov 13, 2025Updated 3 months ago
- Software to enable data-rich collaboration from high-resolution display walls to your laptop☆16Updated this week
- ROS2 catestian_impedance_controller from PdZ☆11Oct 22, 2025Updated 4 months ago
- UR Robotic Arm with Robotiq 2-Finger Gripper for ROS2☆22Feb 27, 2026Updated last week
- [ICRA 2025] A Parameter-Efficient Tuning Framework for Language-guided Object Grounding and Robot Grasping☆11Feb 7, 2025Updated last year
- official repo for AGNOSTOS, a cross-task manipulation benchmark, and X-ICM method, a cross-task in-context manipulation (VLA) method☆61Nov 26, 2025Updated 3 months ago
- [ICLR 2026] This is an early exploration to introduce Interleaving Reasoning to Text-to-image Generation field and achieve the SoTA bench…☆87Jan 26, 2026Updated last month
- [CVPR 2025] Official implementation of the paper "SimMotionEdit: Text-Based Human Motion Editing with Motion Similarity Prediction"☆47Dec 11, 2025Updated 2 months ago
- (NeurIPS 2024) Official repository of paper "Grasp as You Say: Language-guided Dexterous Grasp Generation"☆50Oct 27, 2025Updated 4 months ago
- Benchmark evaluating ocean forecasting systems against reference datasets and observations.☆26Updated this week
- 单无人机对螺旋轨迹跟踪的实物实验☆10May 22, 2023Updated 2 years ago
- Fast, free, easy, and object-agnostic video anonymization☆11Dec 12, 2020Updated 5 years ago
- [IROS' 25] D4DGS-SLAM☆14Jun 16, 2025Updated 8 months ago