zomux / dlmonitorLinks
☆156Updated last month
Alternatives and similar repositories for dlmonitor
Users that are interested in dlmonitor are comparing it to the libraries listed below
Sorting:
- ☆64Updated 5 years ago
- GPT, but made only out of MLPs☆89Updated 4 years ago
- Code for Neural Arithmetic Units (ICLR) and Measuring Arithmetic Extrapolation Performance (SEDL|NeurIPS)☆145Updated 4 years ago
- ☆104Updated 4 years ago
- HetSeq: Distributed GPU Training on Heterogeneous Infrastructure☆106Updated 2 years ago
- ☆153Updated 5 years ago
- Docs☆143Updated 11 months ago
- An assignment on creating a minimalist neural network toolkit for CS11-747☆64Updated last year
- Unit Testing for pytorch, based on mltest☆312Updated 5 years ago
- Profile the GPU memory usage of every line in a Pytorch code☆83Updated 7 years ago
- Trains Transformer model variants. Data isn't shuffled between batches.☆143Updated 3 years ago
- Implementation for the Lookahead Optimizer.☆243Updated 3 years ago
- Browse the CS/AI/ML research paper graph☆51Updated 2 years ago
- a lightweight transformer library for PyTorch☆72Updated 3 years ago
- A collection of code snippets for my PyTorch Lightning projects☆107Updated 4 years ago
- This repository contains the results for the paper: "Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers"☆184Updated 4 years ago
- [NeurIPS 2022] DataMUX: Data Multiplexing for Neural Networks☆60Updated 2 years ago
- PyTorch functions and utilities to make your life easier☆194Updated 4 years ago
- Training Transformer-XL on 128 GPUs☆141Updated 5 years ago
- ☆83Updated 5 years ago
- Configure Python functions explicitly and safely☆127Updated 11 months ago
- The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…☆67Updated 2 years ago
- A demonstration of the attention mechanism with some toy experiments and explanations.☆107Updated 7 years ago
- Toy implementations of some popular ML optimizers using Python/JAX☆44Updated 4 years ago
- Original PyTorch implementation of the Leap meta-learner (https://arxiv.org/abs/1812.01054) along with code for running the Omniglot expe…☆148Updated 2 years ago
- Neural Turing Machines in pytorch☆48Updated 3 years ago
- EfficientNet, MobileNetV3, MobileNetV2, MixNet, etc in JAX w/ Flax Linen and Objax☆128Updated last year
- Research boilerplate for PyTorch.☆149Updated 2 years ago
- A case study of efficient training of large language models using commodity hardware.☆68Updated 3 years ago
- A (possibly/eventually annotated?) collection of resources (books, demos, lectures, etc) that I personally like for various topics in mac…☆32Updated 6 years ago