facebookresearch / dinoLinks
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
☆7,343Updated last year
Alternatives and similar repositories for dino
Users that are interested in dino are comparing it to the libraries listed below
Sorting:
- Official DeiT repository☆4,291Updated last year
- Code release for ConvNeXt model☆6,216Updated 2 years ago
- PyTorch implementation of MAE https//arxiv.org/abs/2111.06377☆8,124Updated last year
- ☆12,087Updated 9 months ago
- PyTorch code and models for the DINOv2 self-supervised learning method.☆11,980Updated 3 months ago
- Scenic: A Jax Library for Computer Vision Research and Beyond☆3,722Updated this week
- Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"☆3,124Updated last year
- PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722☆5,101Updated 2 months ago
- Deformable DETR: Deformable Transformers for End-to-End Object Detection.☆3,815Updated last year
- OpenMMLab Self-Supervised Learning Toolbox and Benchmark☆3,293Updated 2 years ago
- VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.☆3,291Updated last year
- An open source implementation of CLIP.☆13,051Updated last month
- End-to-End Object Detection with Transformers☆14,935Updated last year
- This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".☆15,503Updated last year
- Official PyTorch implementation of SegFormer☆3,217Updated last year
- SimCLRv2 - Big Self-Supervised Models are Strong Semi-Supervised Learners☆4,410Updated 2 years ago
- [ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"☆2,688Updated last year
- OpenMMLab Computer Vision Foundation☆6,326Updated 7 months ago
- Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)☆3,555Updated 11 months ago
- Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners☆2,678Updated 2 years ago
- Grounded Language-Image Pre-training☆2,550Updated last year
- CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image☆31,860Updated last year
- Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)☆2,105Updated 3 years ago
- OpenMMLab Semantic Segmentation Toolbox and Benchmark.☆9,466Updated last year
- EVA Series: Visual Representation Fantasies from BAAI☆2,614Updated last year
- Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.☆3,260Updated 6 months ago
- PyTorch implementation of SwAV https//arxiv.org/abs/2006.09882☆2,086Updated 2 years ago
- Dense Prediction Transformers☆2,286Updated 11 months ago
- Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, I…☆12,416Updated 8 months ago
- A deep learning library for video understanding research.☆3,510Updated last month