HongkLin / TIDELinks
[CVPR 2025] A Unified Image-Dense Annotation Generation Model for Underwater Scenes
☆34Updated 4 months ago
Alternatives and similar repositories for TIDE
Users that are interested in TIDE are comparing it to the libraries listed below
Sorting:
- [CVPR'25] Official implementation of "Semantic Library Adaptation: LoRA Retrieval and Fusion for Open-Vocabulary Semantic Segmentation"☆30Updated 3 weeks ago
- Paper: UniGS: Unified Language-Image-3D Pretraining with Gaussian Splatting☆18Updated 2 months ago
- [CVPR 2025 Highlight🔥] Official code repository for "Inst3D-LMM: Instance-Aware 3D Scene Understanding with Multi-modal Instruction Tuni…☆103Updated last week
- officical code for ECCV 2024 paper "Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection"☆14Updated last year
- Point Could Mamba: Point Cloud Learning via State Space Model☆70Updated 7 months ago
- ScatterFormer: Efficient Voxel Transformer with Scattered Linear Attention (ECCV 2024)☆81Updated 2 months ago
- Official Repository for "MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection" (ECCV 2024)☆49Updated 9 months ago
- Official code of DMA: Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding, ECCV 2024☆29Updated last year
- A curated list of resources focused on Visual AutoRegressive Modeling, makes GPT-style AR models surpass diffusion transformers in image …☆38Updated 5 months ago
- ☆38Updated last year
- [NIPS24] Official Implementation of Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation☆19Updated 9 months ago
- The official implementation of "PixelThink: Towards Efficient Chain-of-Pixel Reasoning" (arXiv 2025)☆36Updated 2 months ago
- ICCV 2025-PerLDiff: Controllable Street View Synthesis Using Perspective-Layout Diffusion Models☆49Updated 3 weeks ago
- Official PyTorch codes for "Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation", ECCV2024☆30Updated last year
- ☆10Updated 3 months ago
- [AAAI 2025] Linear-complexity Visual Sequence Learning with Gated Linear Attention☆110Updated last year
- Official PyTorch implementation of GeoDiffusion in ICLR 2024 (https://arxiv.org/abs/2306.04607)☆90Updated 3 weeks ago
- [AAAI 2025] Pre-Training a Density-Aware Pose Transformer for Robust LiDAR-based 3D Human Pose Estimation☆31Updated 2 months ago
- [IEEE RA-L 2025] Generate Weather with LLM. Code for "WeatherDG: LLM-assisted Procedural Weather Generation for Domain-Generalized Semant…☆40Updated 2 months ago
- [ICCV2025] Official repository of the paper "Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary…☆60Updated this week
- [CVPR 2024 Highlight] OpenESS: Event-Based Semantic Scene Understanding with Open Vocabularies☆54Updated last year
- [CVPR 2025] Official repository of the paper "Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation"☆106Updated last month
- [NeurIPS 2024] TALoS: Enhancing Semantic Scene Completion via Test-time Adaptation on the Line of Sight☆31Updated 3 weeks ago
- [ICLR 2025] Semi-Supervised Vision-Centric 3D Occupancy World Model for Autonomous Driving☆41Updated 5 months ago
- [CVPR 2025 Highlight] Ev3DOD: Pushing the Temporal Boundaries of 3D Object Detection with Event Cameras☆16Updated last week
- [ACM MM2024] Official implementation of the paper "GOI: Find 3D Gaussians of Interest with an Optimizable Open-vocabulary Semantic-space …☆66Updated 9 months ago
- [AAAI 2025] GFlow: Recovering 4D World from Monocular Video☆48Updated 3 months ago
- The official PyTorch code for "Traffic Scene Parsing through the TSP6K Dataset".☆33Updated last month
- [ECCV'24] A novel weakly supervised framework for 3D object detection from 2D bounding boxes. It can easily extend to novel scenarios and…☆31Updated last year
- [CVPR 2024] 🏡Know Your Neighbors: Improving Single-View Reconstruction via Spatial Vision-Language Reasoning☆79Updated last year