wngkj / Lang2SegTrackLinks
This is an open source project that can track and segment specific objects in video streams by manual clicks, box selections, or text prompts.
☆143Updated 2 weeks ago
Alternatives and similar repositories for Lang2SegTrack
Users that are interested in Lang2SegTrack are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Models☆212Updated 3 weeks ago
- [NeurIPS 2025] NAUTILUS: A Large Multimodal Model for Underwater Scene Understanding☆317Updated 2 weeks ago
- [NeurIPS 2025 (D&B)] Rethinking Evaluation of Infrared Small Target Detection☆204Updated last month
- ☆286Updated last month
- (ECCV 2024) Open-Vocabulary Camouflaged Object Segmentation☆154Updated 3 months ago
- A curated list of papers, code and resources pertaining to image composition/compositing or object insertion/addition/compositing, which …☆530Updated 2 months ago
- Official Pytorch implementation for ICML 2025 paper "Large Continual Instruction Assistant"☆65Updated 3 months ago
- Match-Stereo-Videos via Bidirectional Alignment (An update of BiDAStereo)☆82Updated 4 months ago
- ☆15Updated 11 months ago
- (IJCV 2024 & ACM MM 2021 Oral) Multi-Source Fusion and Automatic Predictor Selection for Zero-Shot Video Object Segmentation☆119Updated 3 years ago
- (CVPR 2024 & arXiv 2025) Power Battery Detection☆310Updated last month
- [CVPR 2025 Highlight] Official code for "Olympus: A Universal Task Router for Computer Vision Tasks"☆428Updated 5 months ago
- ☆206Updated 5 months ago
- [CVPR 2025] The code and model for our paper "Shadow Generation Using Diffusion Model with Geometry Prior", CVPR, 2025.☆139Updated 3 months ago
- [ICLR 2025] Ctrl-U: Robust Conditional Image Generation via Uncertainty-aware Reward Modeling☆82Updated 9 months ago
- A curated collection of AI+X papers published in Nature / Science / Cell / Lancet / Radiology and their flagship sub-journals☆135Updated last month
- The summary of code and paper for unified model towards context-dependent (CD) concept segmentation.☆117Updated 2 months ago
- [MM 2025] EventVAD: Training-Free Event-Aware Video Anomaly Detection☆505Updated 4 months ago
- NanoHTNet: Nano Human Topology Network for Efficient 3D Human Pose Estimation☆217Updated last month
- ☆385Updated 4 months ago
- Official code of the paper "Why and How: Knowledge-Guided Learning for Cross-Spectral Image Patch Matching"☆43Updated 9 months ago
- [ICCV2025 Highlight] DicFace: Dirichlet-Constrained Variational Codebook Learning for Temporally Coherent Video Face Restoration☆439Updated 3 months ago
- This is the source code for the ECCV paper "MTFormer: Multi-Task Learning via Transformer and Cross-Task Reasoning"☆200Updated 2 years ago
- https://www.kaggle.com/competitions/image-matching-challenge-2022☆45Updated 2 years ago
- ☆209Updated 4 months ago
- 基于 Qwen2-0.5B 以及 SigLIP 实现的轻量化多模态风格化问答大模型☆28Updated 3 months ago
- SCFlow2: Plug-and-Play Object Pose Refiner with Shape-Constraint Scene Flow, CVPR2025☆51Updated 3 months ago
- hybrid sfm with VIO Pose,RGB and depth data☆52Updated 2 years ago
- ☆67Updated 3 months ago
- [ICCV 2025] LocalDyGS : Multi-view Global Dynamic Scene Modeling through Adaptive Local Feature Decoupling☆105Updated last month