minitorch / Module-0
Module 0 - Fundamentals
☆99Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for Module-0
- MinT: Minimal Transformer Library and Tutorials☆248Updated 2 years ago
- A pure-functional implementation of a machine learning transformer model in Python/JAX☆175Updated 2 years ago
- ☆108Updated last year
- Resources from the EleutherAI Math Reading Group☆51Updated last month
- Docs☆143Updated last month
- Check if you have training samples in your test set☆64Updated 2 years ago
- ☆161Updated last year
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆145Updated 3 years ago
- An interactive exploration of Transformer programming.☆246Updated last year
- Silly twitter torch implementations.☆46Updated 2 years ago
- Named tensors with first-class dimensions for PyTorch☆322Updated last year
- Implementation of the GBST block from the Charformer paper, in Pytorch☆117Updated 3 years ago
- A library to inspect and extract intermediate layers of PyTorch models.☆470Updated 2 years ago
- An interpreter for RASP as described in the ICML 2021 paper "Thinking Like Transformers"☆284Updated 2 months ago
- ☆391Updated last month
- Puzzles for exploring transformers☆325Updated last year
- The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…☆66Updated last year
- A case study of efficient training of large language models using commodity hardware.☆68Updated 2 years ago
- A 🤗-style implementation of BERT using lambda layers instead of self-attention☆70Updated 4 years ago
- http://nlp.seas.harvard.edu/2018/04/03/attention.html☆63Updated 3 years ago
- ☆101Updated 3 years ago
- diagNNose is a Python library that facilitates a broad set of tools for analysing hidden activations of neural models.☆81Updated last year
- An implementation of masked language modeling for Pytorch, made as concise and simple as possible☆177Updated last year
- Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch☆95Updated last year
- ☆38Updated last year
- ☆187Updated 2 years ago
- Transformer Grammars: Augmenting Transformer Language Models with Syntactic Inductive Biases at Scale, TACL (2022)☆119Updated 3 weeks ago
- Framework-agnostic library for checking array/tensor shapes at runtime.☆47Updated 3 years ago
- Rax is a Learning-to-Rank library written in JAX.☆319Updated 3 weeks ago
- ☆334Updated 7 months ago