apple / ml-cvnets
CVNets: A library for training computer vision networks
☆1,793Updated last year
Related projects ⓘ
Alternatives and complementary repositories for ml-cvnets
- EfficientFormerV2 [ICCV 2023] & EfficientFormer [NeurIPs 2022]☆988Updated last year
- This is a collection of our NAS and Vision Transformer work.☆1,679Updated 3 months ago
- Code release for ConvNeXt model☆5,760Updated last year
- Code release for ConvNeXt V2 model☆1,519Updated 2 months ago
- Official DeiT repository☆4,053Updated 7 months ago
- PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)☆1,293Updated 5 months ago
- Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs (CVPR 2022)☆868Updated 6 months ago
- This repository contains the official implementation of the research paper, "An Improved One millisecond Mobile Backbone".☆729Updated 2 years ago
- [ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)☆2,089Updated last year
- A PyTorch implementation of "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer"☆502Updated 2 years ago
- Deformable DETR: Deformable Transformers for End-to-End Object Detection.☆3,228Updated 5 months ago
- This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and …☆1,806Updated last year
- detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.☆2,020Updated 2 months ago
- Collection of common code that's shared among different research projects in FAIR computer vision team.☆2,017Updated 3 weeks ago
- Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022☆1,052Updated 5 months ago
- ☆812Updated last year
- ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet☆1,149Updated last year
- Official implementation of PVT series☆1,722Updated 2 years ago
- [ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"☆2,254Updated 3 months ago
- Per-Pixel Classification is Not All You Need for Semantic Segmentation (NeurIPS 2021, spotlight)☆1,352Updated 2 years ago
- Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Atte…☆790Updated 6 months ago
- Semi-Supervised Learning, Object Detection, ICCV2021☆903Updated 5 months ago
- This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".☆925Updated 2 years ago
- EVA Series: Visual Representation Fantasies from BAAI☆2,293Updated 3 months ago
- RepVGG: Making VGG-style ConvNets Great Again☆3,327Updated last year
- Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"☆2,549Updated 3 months ago
- Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners☆2,598Updated last year
- Implementation of the Swin Transformer in PyTorch.☆794Updated 3 years ago
- Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)☆1,934Updated 2 years ago
- OpenMMLab Model Compression Toolbox and Benchmark.☆1,472Updated 4 months ago