☆110Jun 30, 2023Updated 2 years ago
Alternatives and similar repositories for AiT
Users that are interested in AiT are comparing it to the libraries listed below
Sorting:
- This is an official implementation of our CVPR 2023 paper "Revealing the Dark Secrets of Masked Image Modeling" on Depth Estimation.☆175Mar 27, 2023Updated 2 years ago
- Official implementation and data release of the paper "Visual Prompting via Image Inpainting".☆318Aug 7, 2023Updated 2 years ago
- [ICCV 2023] VPD is a framework that leverages the high-level and low-level knowledge of a pre-trained text-to-image diffusion model to do…☆537Dec 21, 2023Updated 2 years ago
- Official implementation of "Towards 3D Scene Reconstruction from Locally Scale-Aligned Monocular Video Depth (Boosting Monocular Depth Es…☆39Aug 7, 2022Updated 3 years ago
- The code of 'Towards Domain-agnostic depth completion'☆27Aug 4, 2022Updated 3 years ago
- Monocular Depth Estimation Toolbox based on MMSegmentation.☆968Jul 21, 2025Updated 7 months ago
- [TMM2023] URCDC-Depth: Uncertainty Rectified Cross-Distillation with CutFlip for Monocular Depth Estimation☆45Dec 16, 2023Updated 2 years ago
- ☆34Feb 20, 2025Updated last year
- GLPDepth PyTorch Implementation: Global-Local Path Networks for Monocular Depth Estimation with Vertical CutDepth☆199Mar 8, 2024Updated last year
- Code release for Deep Incubation (https://arxiv.org/abs/2212.04129)☆92Mar 16, 2023Updated 2 years ago
- [CVPR 2025] Test-Time Visual In-Context Tuning☆29Dec 31, 2025Updated 2 months ago
- VisionLLM Series☆1,138Feb 27, 2025Updated last year
- Simple Implementation of Pix2Seq model for object detection in PyTorch☆130Sep 2, 2023Updated 2 years ago
- [ICCV'23] Self-supervised Monocular Depth Estimation: Let’s Talk About The Weather☆106Jun 25, 2025Updated 8 months ago
- Voxel Field Fusion for 3D Object Detection (CVPR2022)☆103Jun 1, 2022Updated 3 years ago
- PyTorch Implementation of introducing diffusion approach to 3D depth perception ECCV 2024☆339Oct 31, 2025Updated 4 months ago
- ☆11Oct 20, 2023Updated 2 years ago
- ☆11Aug 10, 2023Updated 2 years ago
- Boundaries and Region Representation Fusion☆12Mar 24, 2023Updated 2 years ago
- SC-Depth (V1, V2, and V3) for Unsupervised Monocular Depth Estimation Webpage//jiawangbian.github.io/sc_depth_pl/☆483Oct 6, 2023Updated 2 years ago
- ☆231Dec 18, 2023Updated 2 years ago
- [CVPR2023] Code Release of Aligning Bag of Regions for Open-Vocabulary Object Detection☆184Oct 25, 2023Updated 2 years ago
- ☆14Nov 25, 2022Updated 3 years ago
- ☆24Jul 16, 2025Updated 7 months ago
- [ICLR 2024 (Spotlight)] "Frozen Transformers in Language Models are Effective Visual Encoder Layers"☆247Jan 17, 2024Updated 2 years ago
- ☆74Dec 8, 2022Updated 3 years ago
- ☆50Nov 10, 2023Updated 2 years ago
- Towards training VQ-VAE models robustly!☆93Jul 14, 2025Updated 7 months ago
- Source code for the Paper "Mind the Gap: Benchmarking Spatial Reasoning in Vision-Language Models"☆18Feb 1, 2026Updated last month
- ☆19Mar 28, 2022Updated 3 years ago
- ☆74Feb 8, 2025Updated last year
- [CVPR'23] AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with Masked Autoencoders☆84Feb 2, 2024Updated 2 years ago
- Teach-DETR: Better Training DETR with Teachers☆31Mar 18, 2024Updated last year
- [ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions☆1,475Jun 3, 2025Updated 9 months ago
- Official PyTorch implementation of the paper "In-Context Learning Unlocked for Diffusion Models"☆413Mar 25, 2024Updated last year
- ☆285Aug 14, 2025Updated 6 months ago
- [CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language☆1,343Oct 5, 2023Updated 2 years ago
- Dense Prediction Transformers☆2,313Dec 18, 2024Updated last year
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆86Jul 16, 2024Updated last year