Traffic-X/ViT-CoMer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Traffic-X/ViT-CoMer)

Traffic-X / ViT-CoMer

Official implementation of the CVPR 2024 paper ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense Predictions.

☆347

Alternatives and similar repositories for ViT-CoMer

Users that are interested in ViT-CoMer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Traffic-X / Open-TransMind
View on GitHub
Official implementation of the CVPR paper Open-TransMind: A New Baseline and Benchmark for 1st Foundation Model Challenge of Intelligent …
☆28Jun 4, 2023Updated 3 years ago
czczup / ViT-Adapter
View on GitHub
[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions
☆1,503Jun 3, 2025Updated last year
tue-mps / benchmark-vfm-ss
View on GitHub
☆37Jul 18, 2025Updated last year
Leiyi-HU / mona
View on GitHub
The official implementation of [CVPR 2025] "5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks".
☆397Jun 23, 2025Updated last year
AFeng-x / SMT
View on GitHub
[ICCV2023] This is an official implementation for "Scale-Aware Modulation Meet Transformer".
☆215Aug 1, 2023Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
DaiShiResearch / TransNeXt
View on GitHub
[CVPR 2024] Code release for TransNeXt model
☆575Jun 13, 2024Updated 2 years ago
Sense-X / Co-DETR
View on GitHub
[ICCV 2023] DETRs with Collaborative Hybrid Assignments Training
☆1,355Dec 29, 2024Updated last year
Traffic-X / MonoLSS
View on GitHub
Official implementation of the 3DV 2024 paper MonoLSS: Learnable Sample Selection For Monocular 3D Detection
☆42Sep 19, 2024Updated last year
PoTsui99 / UniRGB-IR
View on GitHub
Official repo for UniRGB-IR.
☆58Nov 28, 2025Updated 7 months ago
qhfan / RMT
View on GitHub
(CVPR2024)RMT: Retentive Networks Meet Vision Transformer
☆391Jul 29, 2024Updated last year
serdarch / SERNet-Former
View on GitHub
[CVPR 2024 Workshops] SERNet-Former: Semantic Segmentation by Efficient Residual Network with Attention-Boosting Gates and Attention-Fusi…
☆69Nov 28, 2024Updated last year
AIM-SKKU / CSCA
View on GitHub
Spatio-channel Attention Blocks for Cross-modal Crowd Counting -- Official Pytorch Implementation (ACCV'22, Oral)
☆28Dec 4, 2023Updated 2 years ago
AmirMansurian / AttnFD
View on GitHub
[WACV'26] Attention as Geometric Transformation: Revisiting Feature Distillation for Semantic Segmentation
☆43Jun 8, 2026Updated last month
Traffic-X / MonoUNI
View on GitHub
Official implementation of the NeurIPS 2023 paper MonoUNI: A Unified Vehicle and Infrastructure-side Monocular 3D Object Detection Networ…
☆63Jul 14, 2026Updated last week
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
OpenGVLab / DCNv4
View on GitHub
[CVPR 2024] Deformable Convolution v4
☆743May 17, 2024Updated 2 years ago
Atten4Vis / LW-DETR
View on GitHub
This repository is an official implementation of the paper "LW-DETR: A Transformer Replacement to YOLO for Real-Time Detection".
☆505Feb 18, 2025Updated last year
MzeroMiko / VMamba
View on GitHub
VMamba: Visual State Space Models，code is based on mamba
☆3,206Mar 7, 2025Updated last year
xzz777 / SCTNet
View on GitHub
Official implementation of SCTNet (AAAI2024)
☆322Jan 17, 2024Updated 2 years ago
Linwei-Chen / FreqFusion
View on GitHub
TPAMI：Frequency-aware Feature Fusion for Dense Image Prediction
☆502Nov 25, 2025Updated 7 months ago
GSavathrakis / ShipRS_H2OBB
View on GitHub
This is a module whose task is the automated transformation of Horizontal to Oriented Bounding Boxes for ship detection tasks
☆16Sep 5, 2024Updated last year
megvii-research / RevCol
View on GitHub
Official Code of Paper "Reversible Column Networks" "RevColv2"
☆266Sep 6, 2023Updated 2 years ago
xiuqhou / Relation-DETR
View on GitHub
[ECCV2024 Oral] Official implementation of the paper "Relation DETR: Exploring Explicit Position Relation Prior for Object Detection"
☆262Nov 24, 2024Updated last year
NiccoloCavagnero / PEM
View on GitHub
[CVPR 2024] PEM: Prototype-based Efficient MaskFormer for Image Segmentation
☆130Mar 10, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
hyunwoo137 / MetaSeg
View on GitHub
Official Pytorch implementations for "MetaSeg: MetaFormer-based Global Contexts-aware Network for Efficient Semantic Segmentation" (WACV …
☆41Aug 30, 2024Updated last year
NVlabs / STL
View on GitHub
Official Pytorch Implementation of Self-emerging Token Labeling
☆35Mar 27, 2024Updated 2 years ago
NUST-Machine-Intelligence-Laboratory / VideoMAC
View on GitHub
☆19Mar 1, 2024Updated 2 years ago
luogen1996 / LWTransformer
View on GitHub
Lightweight Transformer for Multi-modal Tasks
☆16Dec 9, 2022Updated 3 years ago
SysCV / cascade-detr
View on GitHub
[ICCV'23] Cascade-DETR: Delving into High-Quality Universal Object Detection
☆100Sep 12, 2023Updated 2 years ago
shenyunhang / APE
View on GitHub
[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception
☆609May 8, 2024Updated 2 years ago
lanyunzhu99 / LLaFS
View on GitHub
☆31Nov 28, 2023Updated 2 years ago
OpenGVLab / PIIP
View on GitHub
[NeurIPS 2024 Spotlight ⭐️ & TPAMI 2025] Parameter-Inverted Image Pyramid Networks (PIIP)
☆113Aug 5, 2025Updated 11 months ago
facebookresearch / Mask2Former
View on GitHub
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
☆3,415Jul 29, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ma-xu / Rewrite-the-Stars
View on GitHub
[CVPR 2024] Rewrite the Stars
☆459May 7, 2024Updated 2 years ago
OpenGVLab / InternImage
View on GitHub
[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
☆2,837Mar 25, 2025Updated last year
xinghaochen / DECO
View on GitHub
[ICLR 2025] Official PyTorch implementation of "DECO: Query-Based End-to-End Object Detection with ConvNets"
☆65Jan 23, 2025Updated last year
FoundationVision / GLEE
View on GitHub
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
☆1,172Oct 21, 2024Updated last year
lyuwenyu / RT-DETR
View on GitHub
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥…
☆5,394Jun 15, 2026Updated last month
roymiles / Simple-Recipe-Distillation
View on GitHub
[AAAI 2024] Understanding the Role of the Projector in Knowledge Distillation
☆20Feb 13, 2024Updated 2 years ago
IDEA-Research / detrex
View on GitHub
detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.
☆2,303Sep 11, 2025Updated 10 months ago