wngkj / Lang2SegTrackLinks
This is an open source project that can track and segment specific objects in video streams by manual clicks, box selections, or text prompts.
☆149Updated last month
Alternatives and similar repositories for Lang2SegTrack
Users that are interested in Lang2SegTrack are comparing it to the libraries listed below
Sorting:
- ☆314Updated 3 months ago
- [NeurIPS 2025] More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Models☆215Updated 2 months ago
- Match-Stereo-Videos via Bidirectional Alignment (An update of BiDAStereo)☆83Updated last month
- 🌐 Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systems☆131Updated 3 weeks ago
- [CVPR 2025] The code and model for our paper "Shadow Generation Using Diffusion Model with Geometry Prior", CVPR, 2025.☆140Updated last month
- ☆207Updated 8 months ago
- [CVPR 2025 Highlight] Official code for "Olympus: A Universal Task Router for Computer Vision Tasks"☆428Updated 7 months ago
- A curated collection of AI+X papers published in Nature / Science / Cell / Lancet / Radiology and their flagship sub-journals☆136Updated 3 months ago
- [ICCV2025 Highlight] Stereo Any Video: Temporally Consistent Stereo Matching☆385Updated last month
- [NeurIPS 2025] NAUTILUS: A Large Multimodal Model for Underwater Scene Understanding☆350Updated last month
- A curated list of papers, code and resources pertaining to image composition/compositing or object insertion/addition/compositing, which …☆532Updated last month
- [NeurIPS 2025 (D&B)] Rethinking Evaluation of Infrared Small Target Detection☆352Updated 3 months ago
- [ICCV2025 Highlight] DicFace: Dirichlet-Constrained Variational Codebook Learning for Temporally Coherent Video Face Restoration☆446Updated 6 months ago
- [ICLR 2025] Ctrl-U: Robust Conditional Image Generation via Uncertainty-aware Reward Modeling☆82Updated 2 weeks ago
- ☆385Updated 6 months ago
- This is the source code for the ECCV paper "MTFormer: Multi-Task Learning via Transformer and Cross-Task Reasoning"☆199Updated 3 years ago
- The summary of code and paper for unified model towards context-dependent (CD) concept segmentation.☆119Updated 5 months ago
- Official implemetation of "Enhancing Close-up Novel View Synthesis via Pseudo-labeling" [AAAI 2025]☆15Updated 9 months ago
- [CVPR 2024 Highlight] DiVa360 dataset☆94Updated 6 months ago
- Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views☆181Updated last month
- Official Pytorch implementation for ICML 2025 paper "Large Continual Instruction Assistant"☆66Updated last month
- NanoHTNet: Nano Human Topology Network for Efficient 3D Human Pose Estimation☆219Updated 3 months ago
- [MM 2025] EventVAD: Training-Free Event-Aware Video Anomaly Detection☆517Updated 6 months ago
- [Accepted by Information Fusion] Official code of the paper "Relational Representation Learning Network for Cross-Spectral Image Patch Ma…☆33Updated 4 months ago
- https://www.kaggle.com/competitions/image-matching-challenge-2022☆45Updated 2 years ago
- [ICCV 2025] LocalDyGS : Multi-view Global Dynamic Scene Modeling through Adaptive Local Feature Decoupling☆110Updated 2 months ago
- Official code of the paper "Why and How: Knowledge-Guided Learning for Cross-Spectral Image Patch Matching"☆43Updated 11 months ago
- ☆207Updated 6 months ago
- hybrid sfm with VIO Pose,RGB and depth data☆52Updated 2 years ago
- ☆15Updated last year