mukherjeesrijit / Vision-Transformer-ViT-from-scratchLinks
This repository is an implementation of the ViT paper from scratch with tutorials on model, dataloading, training, inference, finteuning, and application.
☆13Updated last year
Alternatives and similar repositories for Vision-Transformer-ViT-from-scratch
Users that are interested in Vision-Transformer-ViT-from-scratch are comparing it to the libraries listed below
Sorting:
- Repository of notes, code and notebooks in Python for the book "Reinforcement Learning: An Introduction" by Richard S. Sutton and Andrew …☆34Updated 3 weeks ago
- Interactive textbook on state-space models☆197Updated last year
- ☆68Updated 2 years ago
- ☆27Updated last year
- Materials of the Nordic Probabilistic AI School 2023.☆90Updated last year
- Material for the "Probabilistic Machine Learning" Course at the University of Tübingen, Summer Term 2023☆179Updated last year
- ☆104Updated 3 weeks ago
- 📄Small Batch Size Training for Language Models☆60Updated 2 weeks ago
- ☆51Updated last year
- NUS CS5242 Neural Networks and Deep Learning, Xavier Bresson, 2025☆401Updated 4 months ago
- Python & Matlab code for the figures from the book "Learning Theory from First Principles" by Francis Bach☆124Updated last year
- Reliable, minimal and scalable library for pretraining foundation and world models☆58Updated last week
- Learning Deep Representations of Data Distributions☆267Updated this week
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆96Updated last month
- ☆63Updated 5 months ago
- [ICLR'25] Artificial Kuramoto Oscillatory Neurons☆101Updated last month
- Simple, minimal implementation of the Mamba SSM in one pytorch file. Using logcumsumexp (Heisen sequence).☆122Updated 10 months ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆146Updated 3 months ago
- ☆30Updated 10 months ago
- Jax/Flax rewrite of Karpathy's nanoGPT☆60Updated 2 years ago
- Lightning-like training API for JAX with Flax☆42Updated 9 months ago
- Resources needed to start deep learning research. ML/DL/CV/NLP/ML-SYS/RL/Graphs/Maths/Med image lecture videos from professors at esteeme…☆87Updated last month
- ☆42Updated 8 months ago
- 11-785 Introduction to Deep Learning (IDeeL) website with logistics and select course materials☆63Updated last week
- ☆81Updated last year
- PyTorch Code for Energy-Based Transformers paper -- generalizable reasoning and scalable learning☆492Updated last week
- ☆38Updated last year
- My solutions to DLFC - Deep Learning: Foundations and Concepts☆88Updated 5 months ago
- We study toy models of skill learning.☆31Updated 7 months ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆88Updated last year