markriedl / transformer-walkthrough
A walkthrough of transformer architecture code
☆318Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for transformer-walkthrough
- All about the fundamental blocks of TF and JAX!☆271Updated 2 years ago
- MinT: Minimal Transformer Library and Tutorials☆248Updated 2 years ago
- Puzzles for exploring transformers☆325Updated last year
- A library to inspect and extract intermediate layers of PyTorch models.☆470Updated 2 years ago
- Host repository for the "Reproducible Deep Learning" PhD course☆405Updated 2 years ago
- A pure-functional implementation of a machine learning transformer model in Python/JAX☆175Updated 2 years ago
- An interactive exploration of Transformer programming.☆246Updated last year
- ☆391Updated last month
- 100 exercises to learn JAX☆569Updated 2 years ago
- Lightning Bits: Engineering for Researchers repo☆130Updated 2 years ago
- The "tl;dr" on a few notable transformer papers (pre-2022).☆189Updated last year
- All about the fundamentals and working of Diffusion Models☆152Updated last year
- Named tensors with first-class dimensions for PyTorch☆322Updated last year
- Highly commented implementations of Transformers in PyTorch☆128Updated last year
- Helps you write algorithms in PyTorch that adapt to the available (CUDA) memory☆428Updated 2 months ago
- ☆139Updated 3 months ago
- Seminar on Large Language Models (COMP790-101 at UNC Chapel Hill, Fall 2022)☆308Updated last year
- For optimization algorithm research and development.☆449Updated this week
- Package for extracting and mapping the results of every single tensor operation in a PyTorch model in one line of code.☆482Updated last week
- Representation Learning MSc course Summer Semester 2023☆70Updated last year
- Resources from the EleutherAI Math Reading Group☆51Updated last month
- Notebooks for "Probabilistic Machine Learning" book☆202Updated 2 years ago
- A Machine Learning workflow for Slurm.☆146Updated 3 years ago
- What would you do with 1000 H100s...☆903Updated 10 months ago
- ☆751Updated last week
- Official repository for CMU Machine Learning Department's 10721: "Philosophical Foundations of Machine Intelligence".☆260Updated last year
- 🤖 A PyTorch library of curated Transformer models and their composable components☆866Updated 7 months ago
- Automatic gradient descent☆206Updated last year