(CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation
☆35Feb 28, 2026Updated 2 months ago
Alternatives and similar repositories for Long_RVOS
Users that are interested in Long_RVOS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆20Jul 10, 2025Updated 9 months ago
- [WIP] Code for LangToMo☆21Mar 19, 2026Updated last month
- Code and data for EMNLP 2023 paper "Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans?"☆15Jan 25, 2024Updated 2 years ago
- (ICCV 2025) ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations☆137Nov 14, 2025Updated 5 months ago
- CoRL25-"AimBot: A Simple Auxiliary Visual Cue to Enhance Spatial Awareness of Visuomotor Policies"☆45Aug 15, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Mixture of Global and Local Experts with Diffusion Transformer for Controllable Face Generation☆32Dec 10, 2025Updated 4 months ago
- Abductive discourse pipeline for multilingual metaphor interpretation☆10Mar 11, 2020Updated 6 years ago
- Label dialogue with Dialogue Acts and Adjacency Pairs☆12Jun 20, 2023Updated 2 years ago
- ☆53Apr 22, 2026Updated 2 weeks ago
- Code for EMNLP 2022 Paper DANLI: Deliberative Agent for Following Natural Language Instructions☆18May 1, 2025Updated last year
- Pytorch Implementation of "HandNeRF: Learning to Reconstruct Hand-Object Interaction Scene from a Single RGB Image", In ICRA 2024☆27Mar 27, 2024Updated 2 years ago
- CoMA: Compositional Human Motion Generation with Multi-modal Agents☆15Jul 31, 2025Updated 9 months ago
- A lightweight web-based annotation tool for labelling entity recognition data.☆23Aug 19, 2024Updated last year
- [ACL2026] Uni-MMMU : A Massive Multi-discipline Multimodal Unified Benchmark☆24Apr 13, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official code for "Rethinking Chain-of-Thought Reasoning for Videos"☆20Dec 14, 2025Updated 4 months ago
- [CVPR 2024] Selective, Interpretable and Motion Consistent Privacy Attribute Obfuscation for Action Recognition☆12Mar 20, 2024Updated 2 years ago
- MAPLE infuses dexterous manipulation priors from egocentric videos into vision encoders, making their features well-suited for downstream…☆31Dec 9, 2025Updated 4 months ago
- [ICLR'25] Do Egocentric Video-Language Models Truly Understand Hand-Object Interactions?☆12Apr 11, 2025Updated last year
- [NeurIPS 2025] Panoptic Captioning: An Equivalence Bridge for Image and Text☆35Jan 31, 2026Updated 3 months ago
- ☆18Jul 8, 2025Updated 9 months ago
- A python program to extract the dominant colors of an image and to visualize their dominance.☆14Oct 24, 2017Updated 8 years ago
- PyTorch implementation of "HERO: Human Reaction Generation from Videos (ICCV 2025)"☆33Mar 27, 2026Updated last month
- zotero + notion☆19Sep 1, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [CVPR 2026] TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs☆129Apr 27, 2026Updated last week
- ☆15Jun 2, 2025Updated 11 months ago
- ☆41Oct 16, 2025Updated 6 months ago
- Official PyTorch implementation of CVPR2022 paper “Learning to Imagine: Diversify Memory for Incremental Learning using Unlabeled Data”☆13Jul 25, 2022Updated 3 years ago
- ☆28Aug 13, 2025Updated 8 months ago
- Data release for Step Differences in Instructional Video (CVPR24)☆14Jun 19, 2024Updated last year
- ☆11Oct 13, 2024Updated last year
- Implementation of Dream to Manipulate: Compositional World Models Empowering Robot Imitation Learning with Imagination☆37May 7, 2025Updated 11 months ago
- PyTorch implements `Image Super-Resolution Using Very Deep Residual Channel Attention Networks` paper.☆15Dec 6, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- (ECCV 2024) Official PyTorch implementation of paper "Progressive Pretext Task Learning for Human Trajectory Prediction"☆61Apr 25, 2025Updated last year
- (CVPR2025 Highlight) Official repository of paper "Panorama Generation From NFoV Image Done Right"☆19May 29, 2025Updated 11 months ago
- Annotations for the Mistake Detection benchmark of Assembly101☆12Aug 3, 2023Updated 2 years ago
- contact planning for dexterous hand manipulation☆19Jul 8, 2023Updated 2 years ago
- ☆27Dec 9, 2025Updated 4 months ago
- [CVPR 2023] Better “CMOS” Produces Clearer Images: Learning Space-Variant Blur Estimation for Blind Image Super-Resolution☆10Mar 19, 2024Updated 2 years ago
- WBSR: Rethinking Imbalance in Image Super-Resolution for Efficient Inference☆13Oct 8, 2024Updated last year