(CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation
☆36Feb 28, 2026Updated 2 months ago
Alternatives and similar repositories for Long_RVOS
Users that are interested in Long_RVOS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆20Jul 10, 2025Updated 10 months ago
- [WIP] Code for LangToMo☆21Mar 19, 2026Updated 2 months ago
- Code and data for EMNLP 2023 paper "Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans?"☆15Jan 25, 2024Updated 2 years ago
- (ICCV 2025) ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations☆139Nov 14, 2025Updated 6 months ago
- CoRL25-"AimBot: A Simple Auxiliary Visual Cue to Enhance Spatial Awareness of Visuomotor Policies"☆48Aug 15, 2025Updated 9 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Mixture of Global and Local Experts with Diffusion Transformer for Controllable Face Generation☆33Dec 10, 2025Updated 5 months ago
- The offical repo for "LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling"☆133May 15, 2026Updated last week
- Abductive discourse pipeline for multilingual metaphor interpretation☆10Mar 11, 2020Updated 6 years ago
- Label dialogue with Dialogue Acts and Adjacency Pairs☆12Jun 20, 2023Updated 2 years ago
- Code for EMNLP 2022 Paper DANLI: Deliberative Agent for Following Natural Language Instructions☆18May 1, 2025Updated last year
- Pytorch Implementation of "HandNeRF: Learning to Reconstruct Hand-Object Interaction Scene from a Single RGB Image", In ICRA 2024☆27Mar 27, 2024Updated 2 years ago
- CoMA: Compositional Human Motion Generation with Multi-modal Agents☆16Jul 31, 2025Updated 9 months ago
- A lightweight web-based annotation tool for labelling entity recognition data.☆23Aug 19, 2024Updated last year
- [ACL2026] Uni-MMMU : A Massive Multi-discipline Multimodal Unified Benchmark☆25Apr 13, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official code for "Rethinking Chain-of-Thought Reasoning for Videos"☆20Dec 14, 2025Updated 5 months ago
- MAPLE infuses dexterous manipulation priors from egocentric videos into vision encoders, making their features well-suited for downstream…☆32Dec 9, 2025Updated 5 months ago
- [CVPR 2024] Selective, Interpretable and Motion Consistent Privacy Attribute Obfuscation for Action Recognition☆12Mar 20, 2024Updated 2 years ago
- [ICLR'25] Do Egocentric Video-Language Models Truly Understand Hand-Object Interactions?☆13Apr 11, 2025Updated last year
- ☆60May 18, 2026Updated last week
- [NeurIPS 2025] Panoptic Captioning: An Equivalence Bridge for Image and Text☆37Jan 31, 2026Updated 3 months ago
- ☆18Jul 8, 2025Updated 10 months ago
- A python program to extract the dominant colors of an image and to visualize their dominance.☆14Oct 24, 2017Updated 8 years ago
- PyTorch implementation of "HERO: Human Reaction Generation from Videos (ICCV 2025)"☆33Mar 27, 2026Updated last month
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- zotero + notion☆19Sep 1, 2021Updated 4 years ago
- ☆15Jun 2, 2025Updated 11 months ago
- [CVPR 2026] TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs☆134Apr 27, 2026Updated 3 weeks ago
- Official PyTorch implementation of CVPR2022 paper “Learning to Imagine: Diversify Memory for Incremental Learning using Unlabeled Data”☆13Jul 25, 2022Updated 3 years ago
- ☆28Aug 13, 2025Updated 9 months ago
- Data release for Step Differences in Instructional Video (CVPR24)☆14Jun 19, 2024Updated last year
- ☆11Oct 13, 2024Updated last year
- Implementation of Dream to Manipulate: Compositional World Models Empowering Robot Imitation Learning with Imagination☆37May 7, 2025Updated last year
- (ECCV 2024) Official PyTorch implementation of paper "Progressive Pretext Task Learning for Human Trajectory Prediction"☆61Apr 25, 2025Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- PyTorch implements `Image Super-Resolution Using Very Deep Residual Channel Attention Networks` paper.☆15Dec 6, 2022Updated 3 years ago
- (CVPR2025 Highlight) Official repository of paper "Panorama Generation From NFoV Image Done Right"☆19May 29, 2025Updated 11 months ago
- Annotations for the Mistake Detection benchmark of Assembly101☆12Aug 3, 2023Updated 2 years ago
- contact planning for dexterous hand manipulation☆19Jul 8, 2023Updated 2 years ago
- ☆28Dec 9, 2025Updated 5 months ago
- ☆44Oct 16, 2025Updated 7 months ago
- [CVPR 2023] Better “CMOS” Produces Clearer Images: Learning Space-Variant Blur Estimation for Blind Image Super-Resolution☆10Mar 19, 2024Updated 2 years ago