bo-miao/HTR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bo-miao/HTR)

bo-miao / HTR

[TCSVT 2024] Temporally Consistent Referring Video Object Segmentation with Hybrid Memory

☆19

Alternatives and similar repositories for HTR

Users that are interested in HTR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

bo-miao / RefHuman
View on GitHub
[NeurIPS 2024] Referring Human Pose and Mask Estimation In the Wild
☆45May 29, 2026Updated last month
SkyworkAI / DAQ-VS
View on GitHub
Code For Our Work: DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries [ECCV-2024]
☆15Jul 11, 2024Updated 2 years ago
bo-miao / LangMap
View on GitHub
LangMap: A Human-Verified Benchmark for Hierarchical Open-Vocabulary Goal Navigation
☆49Jun 3, 2026Updated last month
lxa9867 / R2VOS
View on GitHub
Robust Referring Video Object Segmentation with Cyclic Structural Consistency [ICCV 2023]
☆30Mar 13, 2024Updated 2 years ago
wudongming97 / OnlineRefer
View on GitHub
[ICCV 2023] OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation
☆58Oct 7, 2023Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
cvlab-kaist / SOLA
View on GitHub
Official implementation of "Referring Video Object Segmentation via Language Aligned Track Selection".
☆41Jun 2, 2025Updated last year
ut-vision / ActionVOS
View on GitHub
[ECCV 2024 Oral] ActionVOS: Actions as Prompts for Video Object Segmentation
☆32Dec 4, 2024Updated last year
heshuting555 / DsHmp
View on GitHub
[CVPR-2024] Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation
☆83Jul 24, 2024Updated 2 years ago
siyuanliii / SLAck
View on GitHub
Official Implementation of ECCV2024 paper: SLAck
☆29Sep 18, 2024Updated last year
buxiangzhiren / VD-IT
View on GitHub
Code for the paper "Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation", ECCV 2024
☆48Sep 28, 2024Updated last year
FreeformRobotics / OTS
View on GitHub
[IROS 2021] Object-to-Scene: Learning to Transfer Object Knowledge to Indoor Scene Recognition.
☆38Apr 29, 2022Updated 4 years ago
GeWu-Lab / Ref-AVS
View on GitHub
The official repo for "Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes", ECCV 2024
☆50Oct 12, 2025Updated 9 months ago
shuheikurita / RefEgo
View on GitHub
☆13Jul 20, 2024Updated 2 years ago
Lynne-Zheng-Linfang / GeoReF
View on GitHub
The code for GeoReF: Geometric Alignment Across Shape Variation for Category-level Object Pose Refinement (CVPR 2024).
☆15Nov 20, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
FreeformRobotics / BORM
View on GitHub
[IROS 2021] Official implementation of paper: "BORM: Bayesian Object Relation Model for Indoor Scene Recognition"
☆12Jul 13, 2022Updated 4 years ago
rongfu-dsb / MPG-SAM2
View on GitHub
[ICCV 2025] MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation
☆23Sep 5, 2025Updated 10 months ago
kagawa588 / GvSeg
View on GitHub
This is the official implementation of "GvSeg: General and Task-Oriented Video Segmentation" (Accepted at ECCV 2024).
☆18Jul 15, 2024Updated 2 years ago
MegEngine / ECCV2022-RIFE
View on GitHub
Official MegEngine Implementation of Real-Time Intermediate Flow Estimation for Video Frame Interpolation
☆30Jul 14, 2022Updated 4 years ago
fengguang94 / CEFNet
View on GitHub
Encoder Fusion Network with Co-Attention Embedding for Referring Image Segmentation, CVPR2021
☆21Aug 17, 2021Updated 4 years ago
rkzheng99 / ViLLa
View on GitHub
Video Reasoning Segmentation
☆26Nov 29, 2024Updated last year
Robertwyq / Object-Affinity
View on GitHub
[TPAMI 2023] Object Affinity Learning: Towards Annotation-free Instance Segmentation
☆14Sep 14, 2023Updated 2 years ago
asudahkzj / Wnet
View on GitHub
Wnet: Audio-Guided Video Object Segmentation via Wavelet-Based Cross-Modal Denoising Networks
☆24Sep 6, 2022Updated 3 years ago
SalesforceAIResearch / ActiveVideoPerception
View on GitHub
Official Code for paper "Active Video Perception: Iterative Evidence Seeking for Agentic Long Video Understanding""
☆18Jun 2, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
lyz21 / SPU-PMD
View on GitHub
SPU-PMD: Self-Supervised Point Cloud Upsampling via Progressive Mesh Deformation (CVPR 2024)
☆13Nov 5, 2025Updated 8 months ago
intelligolabs / R2RIE-CE
View on GitHub
[IROS 24] Official repository of "Mind the Error! Detection and Localization of Instruction Errors in Vision-and-Language Navigation". We…
☆19Apr 1, 2026Updated 3 months ago
RobertLuo1 / NeurIPS2023_SOC
View on GitHub
[NeurIPS 2023] The official implementation of SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation
☆33Mar 16, 2024Updated 2 years ago
GengzeZhou / SAME
View on GitHub
[ICCV 2025] Official implementation of SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts
☆40Apr 3, 2026Updated 3 months ago
princeton-vl / InFlux
View on GitHub
☆16Updated this week
cilinyan / VISA
View on GitHub
[ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model
☆214Aug 5, 2024Updated last year
ChenHsing / VIDiff
View on GitHub
☆39Dec 4, 2023Updated 2 years ago
wangsen99 / LMEE
View on GitHub
(CVPR 26) Explore with Long-term Memory: A Benchmark and Multimodal LLM-based Reinforcement Learning Framework for Embodied Exploration
☆36Mar 8, 2026Updated 4 months ago
whcpumpkin / MO-DDN
View on GitHub
☆16Dec 30, 2025Updated 6 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
mc-lan / ProxyCLIP
View on GitHub
[ECCV2024] ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation
☆120Mar 26, 2025Updated last year
yoxu515 / MITS
View on GitHub
☆21Jul 25, 2024Updated 2 years ago
miriambellver / refvos
View on GitHub
RefVOS
☆28Feb 3, 2021Updated 5 years ago
rkzheng99 / TMT-VIS
View on GitHub
Taxonomy-aware Multi-dataset Joint Training for Video Instance Segmentation (NeurIPS 23)
☆12May 7, 2025Updated last year
3dlg-hcvc / langmonmap
View on GitHub
☆17May 6, 2026Updated 2 months ago
aquastripe / DenseCLIP
View on GitHub
An unofficial implementation for paper "DenseCLIP: Extract Free Dense Labels from CLIP"
☆24Jan 27, 2022Updated 4 years ago
appletea233 / AL-Ref-SAM2
View on GitHub
[AAAI 2025] AL-Ref-SAM 2: Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video…
☆93Dec 23, 2024Updated last year