This repository contains an overview of important follow-up works based on the original Vision Transformer (ViT) by Google.
☆194Jan 3, 2022Updated 4 years ago
Alternatives and similar repositories for Vision-Transformer-papers
Users that are interested in Vision-Transformer-papers are comparing it to the libraries listed below
Sorting:
- ☆59Sep 23, 2022Updated 3 years ago
- ☆24Sep 2, 2022Updated 3 years ago
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.☆22Jan 16, 2023Updated 3 years ago
- ☆17Nov 4, 2022Updated 3 years ago
- Object recognition with Pepper using a deep learning model☆10Sep 16, 2021Updated 4 years ago
- ☆15Oct 24, 2023Updated 2 years ago
- This repository holds code and other relevant files for the NeurIPS 2022 tutorial: Foundational Robustness of Foundation Models.☆72Jan 13, 2023Updated 3 years ago
- ECCV2022,Bootstrapped Masked Autoencoders for Vision BERT Pretraining☆97Nov 2, 2022Updated 3 years ago
- ☆10May 24, 2020Updated 5 years ago
- GPU accelerated Perlin Noise in python☆11Oct 23, 2020Updated 5 years ago
- Course repository for the Spring 2023 COMP664 course "Deep Learning" at UNC☆14Apr 17, 2023Updated 2 years ago
- Official implementation of "In-style: Bridging Text and Uncurated Videos with Style Transfer for Cross-modal Retrieval." ICCV 2023☆11Oct 5, 2023Updated 2 years ago
- Implementation of MAXIM in TensorFlow.☆140Apr 8, 2025Updated 10 months ago
- Pytorch Seq2Seq framework☆27Feb 18, 2026Updated 2 weeks ago
- C语言扩展Python学习记录☆10Jun 3, 2018Updated 7 years ago
- noting something maybe useful in the future.☆10Jul 12, 2020Updated 5 years ago
- Code for ACL 2023 Oral Paper: ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning☆12Aug 23, 2025Updated 6 months ago
- Transforming textual descriptions into process models using deep learning☆15May 16, 2019Updated 6 years ago
- ☆13Sep 27, 2022Updated 3 years ago
- My PhD manuscript LaTeX code and the slides for the defense☆11Feb 2, 2022Updated 4 years ago
- This repository including most of cnn visualizations techniques using pytorch☆14Apr 14, 2020Updated 5 years ago
- ☆14Dec 25, 2020Updated 5 years ago
- Turns educational youtube videos into blog posts☆16Feb 23, 2024Updated 2 years ago
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Dec 21, 2021Updated 4 years ago
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆80Jan 7, 2026Updated last month
- A summarization of Transformer-based architectures for CV tasks, including image classification, object detection, segmentation, and Few-…☆114Jun 17, 2022Updated 3 years ago
- The official code of Linguistic More: Taking a Further Step toward Efficient and Accurate Scene Text Recognition (IJCAI2023)☆27Sep 3, 2023Updated 2 years ago
- A collection of 100 Deep Learning images and visualizations☆81Jul 13, 2021Updated 4 years ago
- CargoCoin is designed to be a smart contract, crypto currency platform, decentralising global trade and transport. The platform target is…☆13Aug 8, 2018Updated 7 years ago
- Bash script to install gcc-4.9.2 and boost-1.57 on CentOS 5.x, CentOS 6.x and Mac OS X. Languages: c++, c and go. Includes tcmalloc on …☆13May 7, 2016Updated 9 years ago
- A lightweight wrapper around https://github.com/facebookresearch/encodec that enables dynamic streamed reading, seeking, metadata and GPU…☆15May 5, 2024Updated last year