(CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation
☆36Feb 28, 2026Updated 3 months ago
Alternatives and similar repositories for Long_RVOS
Users that are interested in Long_RVOS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆21Jul 10, 2025Updated 11 months ago
- Security-native LLM system for AI-generated application security.☆253Jun 4, 2026Updated last week
- [WIP] Code for LangToMo☆21Mar 19, 2026Updated 2 months ago
- Code and data for EMNLP 2023 paper "Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans?"☆15Jan 25, 2024Updated 2 years ago
- (ICCV 2025) ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations☆142Nov 14, 2025Updated 7 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- CoRL25-"AimBot: A Simple Auxiliary Visual Cue to Enhance Spatial Awareness of Visuomotor Policies"☆48Aug 15, 2025Updated 10 months ago
- Mixture of Global and Local Experts with Diffusion Transformer for Controllable Face Generation☆33Dec 10, 2025Updated 6 months ago
- The offical repo for "LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling"☆165May 15, 2026Updated last month
- Abductive discourse pipeline for multilingual metaphor interpretation☆10Mar 11, 2020Updated 6 years ago
- Label dialogue with Dialogue Acts and Adjacency Pairs☆12Jun 20, 2023Updated 2 years ago
- Code for EMNLP 2022 Paper DANLI: Deliberative Agent for Following Natural Language Instructions☆18May 1, 2025Updated last year
- Pytorch Implementation of "HandNeRF: Learning to Reconstruct Hand-Object Interaction Scene from a Single RGB Image", In ICRA 2024☆27Mar 27, 2024Updated 2 years ago
- CoMA: Compositional Human Motion Generation with Multi-modal Agents☆16Jul 31, 2025Updated 10 months ago
- A lightweight web-based annotation tool for labelling entity recognition data.☆23Aug 19, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [ACL2026] Uni-MMMU : A Massive Multi-discipline Multimodal Unified Benchmark☆25Apr 13, 2026Updated 2 months ago
- Official code for "Rethinking Chain-of-Thought Reasoning for Videos"☆21Dec 14, 2025Updated 6 months ago
- MAPLE infuses dexterous manipulation priors from egocentric videos into vision encoders, making their features well-suited for downstream…☆33Dec 9, 2025Updated 6 months ago
- [CVPR 2024] Selective, Interpretable and Motion Consistent Privacy Attribute Obfuscation for Action Recognition☆12Mar 20, 2024Updated 2 years ago
- [ICLR'25] Do Egocentric Video-Language Models Truly Understand Hand-Object Interactions?☆13Apr 11, 2025Updated last year
- ☆84Jun 2, 2026Updated last week
- [NeurIPS 2025] Panoptic Captioning: An Equivalence Bridge for Image and Text☆38Jan 31, 2026Updated 4 months ago
- ☆18Jul 8, 2025Updated 11 months ago
- A python program to extract the dominant colors of an image and to visualize their dominance.☆14Oct 24, 2017Updated 8 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- PyTorch implementation of "HERO: Human Reaction Generation from Videos (ICCV 2025)"☆34Mar 27, 2026Updated 2 months ago
- zotero + notion☆19Sep 1, 2021Updated 4 years ago
- ☆15Jun 2, 2025Updated last year
- [CVPR 2026] TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs☆142Apr 27, 2026Updated last month
- Official PyTorch implementation of CVPR2022 paper “Learning to Imagine: Diversify Memory for Incremental Learning using Unlabeled Data”☆13Jul 25, 2022Updated 3 years ago
- ☆28Aug 13, 2025Updated 10 months ago
- Data release for Step Differences in Instructional Video (CVPR24)☆14Jun 19, 2024Updated last year
- ☆11Oct 13, 2024Updated last year
- Implementation of Dream to Manipulate: Compositional World Models Empowering Robot Imitation Learning with Imagination☆38May 7, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- PyTorch implements `Image Super-Resolution Using Very Deep Residual Channel Attention Networks` paper.☆15Dec 6, 2022Updated 3 years ago
- (ECCV 2024) Official PyTorch implementation of paper "Progressive Pretext Task Learning for Human Trajectory Prediction"☆61Apr 25, 2025Updated last year
- (CVPR2025 Highlight) Official repository of paper "Panorama Generation From NFoV Image Done Right"☆19May 29, 2025Updated last year
- One Discrete Word for Visual Reasoning Overtakes Agentic and Latent Methods☆126Updated this week
- Annotations for the Mistake Detection benchmark of Assembly101☆12Aug 3, 2023Updated 2 years ago
- contact planning for dexterous hand manipulation☆19Jul 8, 2023Updated 2 years ago
- ☆28Dec 9, 2025Updated 6 months ago