wngkj / Lang2SegTrackLinks
This is an open source project that can track and segment specific objects in video streams by manual clicks, box selections, or text prompts.
☆145Updated last month
Alternatives and similar repositories for Lang2SegTrack
Users that are interested in Lang2SegTrack are comparing it to the libraries listed below
Sorting:
- ☆293Updated 2 months ago
- Match-Stereo-Videos via Bidirectional Alignment (An update of BiDAStereo)☆83Updated last week
- [NeurIPS 2025] More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Models☆214Updated last month
- [CVPR 2025] The code and model for our paper "Shadow Generation Using Diffusion Model with Geometry Prior", CVPR, 2025.☆139Updated 2 weeks ago
- [ICCV2025 Highlight] Stereo Any Video: Temporally Consistent Stereo Matching☆375Updated last week
- [NeurIPS 2025] NAUTILUS: A Large Multimodal Model for Underwater Scene Understanding☆341Updated last month
- [NeurIPS 2025 (D&B)] Rethinking Evaluation of Infrared Small Target Detection☆276Updated 2 months ago
- GigaTrain: An Efficient and Scalable Training Framework for AI Models☆252Updated last week
- (ECCV 2024) Open-Vocabulary Camouflaged Object Segmentation☆205Updated 3 months ago
- https://www.kaggle.com/competitions/image-matching-challenge-2022☆45Updated 2 years ago
- [ICLR 2025] Ctrl-U: Robust Conditional Image Generation via Uncertainty-aware Reward Modeling☆82Updated 9 months ago
- [CVPR 2025 Highlight] Official code for "Olympus: A Universal Task Router for Computer Vision Tasks"☆427Updated 6 months ago
- [ICCV2025 Highlight] DicFace: Dirichlet-Constrained Variational Codebook Learning for Temporally Coherent Video Face Restoration☆444Updated 4 months ago
- A curated list of papers, code and resources pertaining to image composition/compositing or object insertion/addition/compositing, which …☆530Updated 2 weeks ago
- This is the source code for the ECCV paper "MTFormer: Multi-Task Learning via Transformer and Cross-Task Reasoning"☆200Updated 3 years ago
- A curated collection of AI+X papers published in Nature / Science / Cell / Lancet / Radiology and their flagship sub-journals☆136Updated 2 months ago
- See&Trek: Training-Free Spatial Prompting for Multimodal Large Language Model☆42Updated this week
- ☆15Updated last year
- [ICCV 2025] LocalDyGS : Multi-view Global Dynamic Scene Modeling through Adaptive Local Feature Decoupling☆108Updated 2 weeks ago
- ☆385Updated 5 months ago
- [TIP 2025] ADStereo: Efficient Stereo Matching with Adaptive Downsampling and Disparity Alignment☆48Updated 6 months ago
- [CVPR 2024 Highlight] DiVa360 dataset☆95Updated 5 months ago
- (IJCV 2024 & ACM MM 2021 Oral) Multi-Source Fusion and Automatic Predictor Selection for Zero-Shot Video Object Segmentation☆119Updated 3 years ago
- Wan2.1 with Controlnet☆178Updated 8 months ago
- hybrid sfm with VIO Pose,RGB and depth data☆52Updated 2 years ago
- SCFlow2: Plug-and-Play Object Pose Refiner with Shape-Constraint Scene Flow, CVPR2025☆53Updated 3 months ago
- (CVPR 2024 & arXiv 2025) Power Battery Detection☆310Updated 2 months ago
- Inspiring the Next Generation of Segment Anything Models: Comprehensively Evaluate SAM and SAM 2 with Diverse Prompts Towards Context-Dep…☆574Updated 3 months ago
- Text-to-3D Generation by 2D Editing☆112Updated 4 months ago
- The summary of code and paper for unified model towards context-dependent (CD) concept segmentation.☆119Updated 3 months ago