OpenGVLab/STM-Evaluation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/OpenGVLab/STM-Evaluation)

OpenGVLab / STM-Evaluation

☆70

Alternatives and similar repositories for STM-Evaluation

Users that are interested in STM-Evaluation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

OpenGVLab / M3I-Pretraining
View on GitHub
[CVPR 2023] implementation of Towards All-in-one Pre-training via Maximizing Multi-modal Mutual Information.
☆91Jun 1, 2023Updated 3 years ago
donglixp / ICL_PaperList
View on GitHub
Paper List for In-context Learning 🌷
☆19Jan 3, 2023Updated 3 years ago
HVision-NKU / Conv2Former
View on GitHub
☆187Jan 2, 2025Updated last year
fundamentalvision / Uni-Perceiver
View on GitHub
☆291Aug 14, 2025Updated 11 months ago
ucasligang / SimViT
View on GitHub
[ICME 2022] code for the paper, SimVit: Exploring a simple vision transformer with sliding windows.
☆67Oct 11, 2022Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
OpenGVLab / InternImage
View on GitHub
[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
☆2,836Mar 25, 2025Updated last year
ziplab / SN-Netv2
View on GitHub
[ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".
☆29Jan 23, 2024Updated 2 years ago
zhechen / Deformable-DETR-REGO
View on GitHub
☆41Sep 21, 2023Updated 2 years ago
megvii-research / RevCol
View on GitHub
Official Code of Paper "Reversible Column Networks" "RevColv2"
☆266Sep 6, 2023Updated 2 years ago
zehuichen123 / NoiseDet
View on GitHub
[ICCV2023] NoiseDet: Learning from Noisy Data for Semi-Superivsed 3D Object Detection
☆20Feb 5, 2023Updated 3 years ago
Zeqiang-Lai / Mini-DALLE3
View on GitHub
Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models
☆313Dec 28, 2023Updated 2 years ago
OpenGVLab / Awesome-DragGAN
View on GitHub
Awesome-DragGAN: A curated list of papers, tutorials, repositories related to DragGAN
☆83Nov 8, 2023Updated 2 years ago
OpenGVLab / DDPS
View on GitHub
Official Implementation of "Denoising Diffusion Semantic Segmentation with Mask Prior Modeling"
☆76Jul 27, 2023Updated 2 years ago
X2FD / LVIS-INSTRUCT4V
View on GitHub
☆134Dec 22, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
OpenGVLab / InternLMM
View on GitHub
☆16Jul 6, 2023Updated 3 years ago
JIA-Lab-research / SA-AutoAug
View on GitHub
Scale-aware Automatic Augmentation for Object Detection (CVPR 2021)
☆198Aug 24, 2022Updated 3 years ago
zhiqi-li / WechatLogger
View on GitHub
一个mmcv 的logger hook, 可以用来把模型结果推送到微信上
☆21Oct 11, 2022Updated 3 years ago
hustvl / RILS
View on GitHub
[CVPR 2023] RILS: Masked Visual Reconstruction in Language Semantic Space (https://arxiv.org/abs/2301.06958)
☆44Sep 5, 2023Updated 2 years ago
sail-sg / metaformer
View on GitHub
MetaFormer Baselines for Vision (TPAMI 2024)
☆500Jun 1, 2024Updated 2 years ago
daijifeng001 / NSFC-LaTex
View on GitHub
☆38Mar 24, 2023Updated 3 years ago
OpenGVLab / Siamese-Image-Modeling
View on GitHub
[CVPR 2023]Implementation of Siamese Image Modeling for Self-Supervised Vision Representation Learning
☆41Jun 6, 2024Updated 2 years ago
enyac-group / supmae
View on GitHub
This is a offical PyTorch/GPU implementation of SupMAE.
☆80Aug 30, 2022Updated 3 years ago
AILab-CVC / VL-GPT
View on GitHub
VL-GPT: A Generative Pre-trained Transformer for Vision and Language Understanding and Generation
☆86Sep 12, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
amirbar / visual_prompting
View on GitHub
Official implementation and data release of the paper "Visual Prompting via Image Inpainting".
☆319Aug 7, 2023Updated 2 years ago
OpenGVLab / DCNv4
View on GitHub
[CVPR 2024] Deformable Convolution v4
☆743May 17, 2024Updated 2 years ago
OpenGVLab / Official-ConvMAE-Det
View on GitHub
☆18Aug 23, 2022Updated 3 years ago
jshilong / DDQ
View on GitHub
(CVPR2023)Dense Distinct Query for End-to-End Object Detection
☆266May 24, 2023Updated 3 years ago
NVlabs / M2BEV
View on GitHub
☆59Apr 18, 2022Updated 4 years ago
speedinghzl / ShuffleTransformer
View on GitHub
☆25Jun 24, 2021Updated 5 years ago
LeapLabTHU / Deep-Incubation
View on GitHub
Code release for Deep Incubation (https://arxiv.org/abs/2212.04129)
☆92Mar 16, 2023Updated 3 years ago
changlin31 / AutoProg
View on GitHub
(CVPR 2022) Automated Progressive Learning for Efficient Training of Vision Transformers
☆25Feb 26, 2025Updated last year
FocalNet / FocalNet-DINO
View on GitHub
This repo contains the code and configuration files for reproducing object detection results of FocalNets with DINO
☆68Mar 10, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
OpenDriveLab / maskalign
View on GitHub
[CVPR 2023] Official repository for paper "Stare at What You See: Masked Image Modeling without Reconstruction"
☆71Jul 2, 2025Updated last year
jshilong / GroupRCNN
View on GitHub
Group R-CNN for Point-based Weakly Semi-supervised Object Detection (CVPR2022)
☆136May 24, 2023Updated 3 years ago
Megvii-BaseDetection / DeFCN
View on GitHub
End-to-End Object Detection with Fully Convolutional Network
☆494Jan 10, 2022Updated 4 years ago
ChangyaoTian / ADDP
View on GitHub
The official implementation of ADDP (ICLR 2024)
☆12Mar 27, 2024Updated 2 years ago
yuecao0119 / MMInstruct
View on GitHub
[SCIS 2024] The official implementation of the paper "MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Di…
☆64Nov 7, 2024Updated last year
TencentARC / TaCA
View on GitHub
Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".
☆16Jun 20, 2023Updated 3 years ago
TencentARC / ConMIM
View on GitHub
Official codes for ConMIM (ICLR 2023)
☆58Feb 8, 2023Updated 3 years ago