☆72Mar 10, 2025Updated 11 months ago
Alternatives and similar repositories for STM-Evaluation
Users that are interested in STM-Evaluation are comparing it to the libraries listed below
Sorting:
- [CVPR 2023] implementation of Towards All-in-one Pre-training via Maximizing Multi-modal Mutual Information.☆91Jun 1, 2023Updated 2 years ago
- [ICME 2022] code for the paper, SimVit: Exploring a simple vision transformer with sliding windows.☆68Oct 11, 2022Updated 3 years ago
- ☆285Aug 14, 2025Updated 6 months ago
- Paper List for In-context Learning 🌷☆20Jan 3, 2023Updated 3 years ago
- ☆188Jan 2, 2025Updated last year
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆29Jan 23, 2024Updated 2 years ago
- Official Code of Paper "Reversible Column Networks" "RevColv2"☆265Sep 6, 2023Updated 2 years ago
- [CVPR 2023] RILS: Masked Visual Reconstruction in Language Semantic Space (https://arxiv.org/abs/2301.06958)☆44Sep 5, 2023Updated 2 years ago
- ☆41Sep 21, 2023Updated 2 years ago
- ☆25Jun 24, 2021Updated 4 years ago
- ☆16Jul 7, 2023Updated 2 years ago
- Official implementation and data release of the paper "Visual Prompting via Image Inpainting".☆318Aug 7, 2023Updated 2 years ago
- A practice for million-scale multi-domain universal object detection☆28Jun 13, 2024Updated last year
- [CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions☆2,793Mar 25, 2025Updated 11 months ago
- Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models☆313Dec 28, 2023Updated 2 years ago
- (CVPR2023)Dense Distinct Query for End-to-End Object Detection☆264May 24, 2023Updated 2 years ago
- SOIT: Segmenting Objects with Instance-Aware Transformers☆14Jun 6, 2022Updated 3 years ago
- [ICCV2023] NoiseDet: Learning from Noisy Data for Semi-Superivsed 3D Object Detection☆21Feb 5, 2023Updated 3 years ago
- This is a offical PyTorch/GPU implementation of SupMAE.☆80Aug 30, 2022Updated 3 years ago
- This repo contains the code and configuration files for reproducing object detection results of FocalNets with DINO☆68Mar 10, 2023Updated 2 years ago
- ☆105Jul 7, 2023Updated 2 years ago
- 一个mmcv 的logger hook, 可以用来把模型结果推送到微信上☆21Oct 11, 2022Updated 3 years ago
- ☆20Feb 22, 2021Updated 5 years ago
- Scale-aware Automatic Augmentation for Object Detection (CVPR 2021)☆199Aug 24, 2022Updated 3 years ago
- Group R-CNN for Point-based Weakly Semi-supervised Object Detection (CVPR2022)☆138May 24, 2023Updated 2 years ago
- MetaFormer Baselines for Vision (TPAMI 2024)☆495Jun 1, 2024Updated last year
- Official Codes and Pretrained Models for RecursiveMix☆22Apr 24, 2023Updated 2 years ago
- ☆134Dec 22, 2023Updated 2 years ago
- Implementation of the paper ''Implicit Feature Refinement for Instance Segmentation''.☆20Oct 27, 2021Updated 4 years ago
- MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning [NeurIPS 2025 Poster]☆22Dec 10, 2025Updated 2 months ago
- ☆97Mar 23, 2021Updated 4 years ago
- A Close Look at Spatial Modeling: From Attention to Convolution☆92Dec 27, 2022Updated 3 years ago
- Code release for Deep Incubation (https://arxiv.org/abs/2212.04129)☆92Mar 16, 2023Updated 2 years ago
- VL-GPT: A Generative Pre-trained Transformer for Vision and Language Understanding and Generation☆86Sep 12, 2024Updated last year
- Official codes for ConMIM (ICLR 2023)☆58Feb 8, 2023Updated 3 years ago
- Implementation of Enhancing Your Trained DETRs with Box Refinement☆60Jul 26, 2023Updated 2 years ago
- [CVPR2023] This is an official implementation of paper "DETRs with Hybrid Matching".☆276Apr 14, 2023Updated 2 years ago
- [Preprint 2022] “Can We Solve 3D Vision Tasks Starting from A 2D Vision Transformer?” by Yi Wang, Zhiwen Fan, Tianlong Chen, Hehe Fan, Zh…☆63Jan 18, 2023Updated 3 years ago
- ☆60Apr 18, 2022Updated 3 years ago