wngkj / Lang2SegTrackLinks
This is an open source project that can track and segment specific objects in video streams by manual clicks, box selections, or text prompts.
☆149Updated 2 weeks ago
Alternatives and similar repositories for Lang2SegTrack
Users that are interested in Lang2SegTrack are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Models☆215Updated 2 months ago
- ☆303Updated 2 months ago
- [NeurIPS 2025] NAUTILUS: A Large Multimodal Model for Underwater Scene Understanding☆347Updated 2 weeks ago
- [NeurIPS 2025 (D&B)] Rethinking Evaluation of Infrared Small Target Detection☆341Updated 2 months ago
- 🌐 Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systems☆120Updated this week
- A curated list of papers, code and resources pertaining to image composition/compositing or object insertion/addition/compositing, which …☆533Updated last month
- Text-to-3D Generation by 2D Editing☆112Updated 5 months ago
- [CVPR 2025 Highlight] Official code for "Olympus: A Universal Task Router for Computer Vision Tasks"☆428Updated 7 months ago
- Match-Stereo-Videos via Bidirectional Alignment (An update of BiDAStereo)☆83Updated 3 weeks ago
- [ICCV2025 Highlight] Stereo Any Video: Temporally Consistent Stereo Matching☆381Updated 3 weeks ago
- A curated collection of AI+X papers published in Nature / Science / Cell / Lancet / Radiology and their flagship sub-journals☆136Updated 3 months ago
- [CVPR 2025] The code and model for our paper "Shadow Generation Using Diffusion Model with Geometry Prior", CVPR, 2025.☆139Updated last month
- [ICCV2025 Highlight] DicFace: Dirichlet-Constrained Variational Codebook Learning for Temporally Coherent Video Face Restoration☆446Updated 5 months ago
- The summary of code and paper for unified model towards context-dependent (CD) concept segmentation.☆119Updated 4 months ago
- [ICLR 2025] Ctrl-U: Robust Conditional Image Generation via Uncertainty-aware Reward Modeling☆82Updated 10 months ago
- [ICCV 2025] LocalDyGS : Multi-view Global Dynamic Scene Modeling through Adaptive Local Feature Decoupling☆108Updated last month
- Wan2.1 with Controlnet☆179Updated 9 months ago
- (CVPR 2024 & arXiv 2025) Power Battery Detection☆310Updated 3 months ago
- Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views☆172Updated 3 weeks ago
- [CVPR 2024 Highlight] DiVa360 dataset☆95Updated 5 months ago
- This is the source code for the ECCV paper "MTFormer: Multi-Task Learning via Transformer and Cross-Task Reasoning"☆200Updated 3 years ago
- ☆386Updated 5 months ago
- [NeurIPS'2025] Official repository for "LiveStar: Live Streaming Assistant for Real-World Online Video Understanding"☆101Updated last month
- (IJCV 2024 & ACM MM 2021 Oral) Multi-Source Fusion and Automatic Predictor Selection for Zero-Shot Video Object Segmentation☆119Updated 3 years ago
- (TIP 2022) Joint Learning of Salient Object Detection, Depth Estimation and Contour Extraction☆109Updated 9 months ago
- NanoHTNet: Nano Human Topology Network for Efficient 3D Human Pose Estimation☆218Updated 2 months ago
- ☆207Updated 7 months ago
- hybrid sfm with VIO Pose,RGB and depth data☆52Updated 2 years ago
- (ECCV 2024) Open-Vocabulary Camouflaged Object Segmentation☆268Updated 4 months ago
- Official Pytorch implementation for ICML 2025 paper "Large Continual Instruction Assistant"☆66Updated last week