(CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation
☆33Feb 28, 2026Updated last month
Alternatives and similar repositories for Long_RVOS
Users that are interested in Long_RVOS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆20Jul 10, 2025Updated 9 months ago
- [WIP] Code for LangToMo☆20Mar 19, 2026Updated 3 weeks ago
- Code and data for EMNLP 2023 paper "Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans?"☆15Jan 25, 2024Updated 2 years ago
- CoRL25-"AimBot: A Simple Auxiliary Visual Cue to Enhance Spatial Awareness of Visuomotor Policies"☆45Aug 15, 2025Updated 8 months ago
- (ICCV 2025) ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations☆136Nov 14, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Mixture of Global and Local Experts with Diffusion Transformer for Controllable Face Generation☆29Dec 10, 2025Updated 4 months ago
- Abductive discourse pipeline for multilingual metaphor interpretation☆10Mar 11, 2020Updated 6 years ago
- Label dialogue with Dialogue Acts and Adjacency Pairs☆12Jun 20, 2023Updated 2 years ago
- Code for EMNLP 2022 Paper DANLI: Deliberative Agent for Following Natural Language Instructions☆18May 1, 2025Updated 11 months ago
- Pytorch Implementation of "HandNeRF: Learning to Reconstruct Hand-Object Interaction Scene from a Single RGB Image", In ICRA 2024☆27Mar 27, 2024Updated 2 years ago
- A lightweight web-based annotation tool for labelling entity recognition data.☆23Aug 19, 2024Updated last year
- [ACL2026] Uni-MMMU : A Massive Multi-discipline Multimodal Unified Benchmark☆23Feb 13, 2026Updated 2 months ago
- Official code for "Rethinking Chain-of-Thought Reasoning for Videos"☆20Dec 14, 2025Updated 4 months ago
- [CVPR 2024] Selective, Interpretable and Motion Consistent Privacy Attribute Obfuscation for Action Recognition☆12Mar 20, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- MAPLE infuses dexterous manipulation priors from egocentric videos into vision encoders, making their features well-suited for downstream…☆30Dec 9, 2025Updated 4 months ago
- [CVPR 2026] TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs☆123Mar 12, 2026Updated last month
- [NeurIPS 2025] Panoptic Captioning: An Equivalence Bridge for Image and Text☆35Jan 31, 2026Updated 2 months ago
- [ICLR'25] Do Egocentric Video-Language Models Truly Understand Hand-Object Interactions?☆12Apr 11, 2025Updated last year
- ☆18Jul 8, 2025Updated 9 months ago
- A python program to extract the dominant colors of an image and to visualize their dominance.☆14Oct 24, 2017Updated 8 years ago
- PyTorch implementation of "HERO: Human Reaction Generation from Videos (ICCV 2025)"☆32Mar 27, 2026Updated 2 weeks ago
- zotero + notion☆18Sep 1, 2021Updated 4 years ago
- ☆15Jun 2, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆41Oct 16, 2025Updated 5 months ago
- Official PyTorch implementation of CVPR2022 paper “Learning to Imagine: Diversify Memory for Incremental Learning using Unlabeled Data”☆13Jul 25, 2022Updated 3 years ago
- ☆28Aug 13, 2025Updated 8 months ago
- Data release for Step Differences in Instructional Video (CVPR24)☆14Jun 19, 2024Updated last year
- ☆11Oct 13, 2024Updated last year
- Annotations for the Mistake Detection benchmark of Assembly101☆11Aug 3, 2023Updated 2 years ago
- Implementation of Dream to Manipulate: Compositional World Models Empowering Robot Imitation Learning with Imagination☆36May 7, 2025Updated 11 months ago
- PyTorch implements `Image Super-Resolution Using Very Deep Residual Channel Attention Networks` paper.☆15Dec 6, 2022Updated 3 years ago
- (ECCV 2024) Official PyTorch implementation of paper "Progressive Pretext Task Learning for Human Trajectory Prediction"☆61Apr 25, 2025Updated 11 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- (CVPR2025 Highlight) Official repository of paper "Panorama Generation From NFoV Image Done Right"☆19May 29, 2025Updated 10 months ago
- (NeurIPS 2024) Official repository of paper "Grasp as You Say: Language-guided Dexterous Grasp Generation"☆54Mar 30, 2026Updated 2 weeks ago
- contact planning for dexterous hand manipulation☆19Jul 8, 2023Updated 2 years ago
- 4RC: 4D Reconstruction via Conditional Querying Anytime and Anywhere☆107Feb 11, 2026Updated 2 months ago
- ☆27Dec 9, 2025Updated 4 months ago
- [CVPR 2023] Better “CMOS” Produces Clearer Images: Learning Space-Variant Blur Estimation for Blind Image Super-Resolution☆10Mar 19, 2024Updated 2 years ago
- WBSR: Rethinking Imbalance in Image Super-Resolution for Efficient Inference☆13Oct 8, 2024Updated last year