pliang279 / awesome-multimodal-mlLinks
Reading list for research topics in multimodal machine learning
☆6,504Updated 10 months ago
Alternatives and similar repositories for awesome-multimodal-ml
Users that are interested in awesome-multimodal-ml are comparing it to the libraries listed below
Sorting:
- A curated list of Multimodal Related Research.☆1,355Updated last year
- Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)☆3,495Updated 5 months ago
- This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".☆14,911Updated 10 months ago
- PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722☆4,998Updated last month
- ☆11,478Updated 3 months ago
- A PyTorch implementation of the Transformer model in "Attention is All You Need".☆9,242Updated last year
- A curated list of awesome self-supervised methods☆6,291Updated 11 months ago
- [CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize …☆1,897Updated last year
- Recent Advances in Vision and Language PreTrained Models (VL-PTMs)☆1,152Updated 2 years ago
- Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"☆1,471Updated last year
- Acceptance rates for the major AI conferences☆4,513Updated 4 months ago
- PyTorch deep learning projects made easy.☆4,957Updated last year
- awesome grounding: A curated list of research papers in visual grounding☆1,078Updated 2 years ago
- Official DeiT repository☆4,216Updated last year
- AI conference deadline countdowns☆5,848Updated 9 months ago
- An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites☆4,886Updated 10 months ago
- 🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.☆15,281Updated 2 years ago
- The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.☆6,168Updated 2 months ago
- PyTorch implementation of "Supervised Contrastive Learning" (and SimCLR incidentally)☆3,304Updated last year
- Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, I…☆11,783Updated 2 months ago
- TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.☆1,613Updated this week
- Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Py…☆23,145Updated 3 months ago
- Awesome Knowledge-Distillation. 分类整理的知识蒸馏paper(2014-2021)。☆2,602Updated 2 years ago
- Pretrained ConvNets for pytorch: NASNet, ResNeXt, ResNet, InceptionV4, InceptionResnetV2, Xception, DPN, etc.☆9,092Updated 3 years ago
- Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)☆1,979Updated last year
- A comprehensive list of awesome contrastive self-supervised learning papers.☆1,276Updated 9 months ago
- A curated list of resources for Learning with Noisy Labels☆2,682Updated last month
- Recent Transformer-based CV and related works.☆1,333Updated last year
- A concise but complete full-attention transformer with a set of promising experimental features from various papers☆5,396Updated this week
- The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…☆34,507Updated this week