HongkLin / TIDELinks
[CVPR 2025] A Unified Image-Dense Annotation Generation Model for Underwater Scenes
☆49Updated 8 months ago
Alternatives and similar repositories for TIDE
Users that are interested in TIDE are comparing it to the libraries listed below
Sorting:
- [CVPR'25] Official implementation of "Semantic Library Adaptation: LoRA Retrieval and Fusion for Open-Vocabulary Semantic Segmentation"☆41Updated 2 months ago
- [AAAI 2025] Linear-complexity Visual Sequence Learning with Gated Linear Attention☆116Updated last year
- A curated list of resources focused on Visual AutoRegressive Modeling, makes GPT-style AR models surpass diffusion transformers in image …☆38Updated 9 months ago
- ☆12Updated 6 months ago
- [ICCV 2025] Revisiting Efficient Semantic Segmentation: Learning Offsets for Better Spatial and Class Feature Alignment☆46Updated last month
- [ACM MM 2025] EmbodiedOcc++: Boosting Embodied 3D Occupancy Prediction with Plane Regularization and Uncertainty Sampler☆23Updated 4 months ago
- [RA-L] Generate Weather with LLM. Code for "WeatherDG: LLM-assisted Procedural Weather Generation for Domain-Generalized Semantic Segment…☆47Updated 6 months ago
- [ICCV 2025 Oral] CorrCLIP: Reconstructing Patch Correlations in CLIP for Open-Vocabulary Semantic Segmentation☆54Updated 4 months ago
- ☆47Updated 4 months ago
- [NIPS24] Official Implementation of Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation☆20Updated last year
- [ICCV 2025] Official repository of the paper "Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabular…☆152Updated last month
- Unveiling the Potential of Segment Anything Model 2 for RGB-Thermal Semantic Segmentation with Language Guidance☆11Updated 2 weeks ago
- ☆22Updated 5 months ago
- [CVPR'2025] EntitySAM: Segment Everything in Video☆57Updated 4 months ago
- [ICML2025 Oral] ReferSplat: Referring Segmentation in 3D Gaussian Splatting☆122Updated 2 months ago
- [ICCV 2025] Boosting Multi-View Indoor 3D Object Detection via Adaptive 3D Volume Construction☆20Updated 2 months ago
- Code of our CVPR2024 paper - DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data☆59Updated last year
- Official code of DMA: Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding, ECCV 2024☆30Updated last year
- The official implementation of "PixelThink: Towards Efficient Chain-of-Pixel Reasoning" (arXiv 2025)☆38Updated 6 months ago
- [NeurIPS 2025] Leveraging Depth and Language for Open-Vocabulary Domain-Generalized Semantic Segmentation☆22Updated last month
- Atlantis: Enabling Underwater Depth Estimation with Stable Diffusion (CVPR2024, Highlight)☆116Updated last year
- official repository of CVPR 2024 paper, RMem: Restricted Memory Banks Improve Video Object Segmentation☆52Updated 10 months ago
- Paper: UniGS: Unified Language-Image-3D Pretraining with Gaussian Splatting☆28Updated 6 months ago
- An open source codebase for object detection based on Jittor☆19Updated 9 months ago
- Official PyTorch implementation of GeoDiffusion in ICLR 2024 (https://arxiv.org/abs/2306.04607)☆96Updated 4 months ago
- ICCV 2025-PerLDiff: Controllable Street View Synthesis Using Perspective-Layout Diffusion Models☆51Updated 4 months ago
- Code & Weights for “Learning Robust Anymodal Segmentor with Unimodal and Cross-modal Distillation”☆14Updated last year
- [IJCV 2024]☆19Updated last year
- Offical implementation of "Re-Aligning Language to Visual Objects with an Agentic Workflow"☆31Updated 7 months ago
- ☆13Updated 7 months ago