minitorch / Module-0
Module 0 - Fundamentals
☆101Updated 7 months ago
Alternatives and similar repositories for Module-0:
Users that are interested in Module-0 are comparing it to the libraries listed below
- A pure-functional implementation of a machine learning transformer model in Python/JAX☆177Updated 2 months ago
- MinT: Minimal Transformer Library and Tutorials☆253Updated 2 years ago
- ☆108Updated 2 years ago
- Docs☆143Updated 4 months ago
- Silly twitter torch implementations.☆46Updated 2 years ago
- Transformer Grammars: Augmenting Transformer Language Models with Syntactic Inductive Biases at Scale, TACL (2022)☆125Updated 5 months ago
- ☆102Updated 4 years ago
- Resources from the EleutherAI Math Reading Group☆53Updated last month
- ☆428Updated 5 months ago
- The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…☆67Updated 2 years ago
- An interactive exploration of Transformer programming.☆262Updated last year
- A case study of efficient training of large language models using commodity hardware.☆69Updated 2 years ago
- Seminar on Large Language Models (COMP790-101 at UNC Chapel Hill, Fall 2022)☆310Updated 2 years ago
- Repository containing code for "How to Train BERT with an Academic Budget" paper☆312Updated last year
- ☆153Updated 4 years ago
- Framework-agnostic library for checking array/tensor shapes at runtime.☆46Updated 3 years ago
- Puzzles for exploring transformers☆342Updated last year
- Named tensors with first-class dimensions for PyTorch☆320Updated last year
- Implementation of the GBST block from the Charformer paper, in Pytorch☆116Updated 3 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆146Updated 3 years ago
- Python Research Framework☆106Updated 2 years ago
- Neural Networks and the Chomsky Hierarchy☆205Updated last year
- Check if you have training samples in your test set☆64Updated 2 years ago
- ☆346Updated last year
- ☆166Updated last year
- ☆87Updated 2 years ago
- ☆85Updated 4 years ago
- Functional local implementations of main model parallelism approaches☆95Updated 2 years ago
- Implicit MLE: Backpropagating Through Discrete Exponential Family Distributions☆258Updated last year
- a lightweight transformer library for PyTorch☆71Updated 3 years ago