google-research / scenicLinks

Scenic: A Jax Library for Computer Vision Research and Beyond

☆3,626

Alternatives and similar repositories for scenic

Users that are interested in scenic are comparing it to the libraries listed below

Sorting:

google-research / big_vision
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
☆3,055Updated 2 months ago
baaivision / EVA
EVA Series: Visual Representation Fantasies from BAAI
☆2,547Updated last year
facebookresearch / TimeSformer
The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"
☆1,739Updated last year
facebookresearch / deit
Official DeiT repository
☆4,241Updated last year
microsoft / GLIP
Grounded Language-Image Pre-training
☆2,474Updated last year
facebookresearch / dino
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
☆7,039Updated last year
facebookresearch / multimodal
TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.
☆1,638Updated this week
microsoft / Cream
This is a collection of our NAS and Vision Transformer work.
☆1,787Updated last year
facebookresearch / ConvNeXt
Code release for ConvNeXt model
☆6,085Updated 2 years ago
SwinTransformer / Video-Swin-Transformer
This is an official implementation for "Video Swin Transformers".
☆1,572Updated 2 years ago
lucidrains / CoCa-pytorch
Implementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch
☆1,164Updated last year
facebookresearch / vissl
VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.
☆3,285Updated last year
hila-chefer / Transformer-Explainability
[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize …
☆1,915Updated last year
google-research / vision_transformer
☆11,649Updated 5 months ago
facebookresearch / ConvNeXt-V2
Code release for ConvNeXt V2 model
☆1,804Updated 11 months ago
MCG-NJU / VideoMAE
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
☆1,553Updated last year
facebookresearch / pytorchvideo
A deep learning library for video understanding research.
☆3,463Updated 6 months ago
facebookresearch / hiera
Hiera: A fast, powerful, and simple hierarchical vision transformer.
☆1,010Updated last year
ShoufaChen / DiffusionDet
[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)
☆2,193Updated 2 years ago
DirtyHarryLYL / Transformer-in-Vision
Recent Transformer-based CV and related works.
☆1,335Updated last year
open-mmlab / mmselfsup
OpenMMLab Self-Supervised Learning Toolbox and Benchmark
☆3,276Updated 2 years ago
facebookresearch / mae
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
☆7,937Updated last year
jeonsworld / ViT-pytorch
Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)
☆2,069Updated 3 years ago
cmhungsteve / Awesome-Transformer-Attention
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
☆4,913Updated last year
facebookresearch / Mask2Former
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
☆2,911Updated last year
apple / ml-cvnets
CVNets: A library for training computer vision networks
☆1,906Updated last year
facebookresearch / moco-v3
PyTorch implementation of MoCo v3 https//arxiv.org/abs/2104.02057
☆1,281Updated 3 years ago
facebookresearch / Detic
Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".
☆1,961Updated last year
jacobgil / vit-explain
Explainability for Vision Transformers
☆988Updated 3 years ago
facebookresearch / moco
PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722
☆5,033Updated last month