zomux / dlmonitorLinks
☆159Updated 4 months ago
Alternatives and similar repositories for dlmonitor
Users that are interested in dlmonitor are comparing it to the libraries listed below
Sorting:
- ☆104Updated 5 years ago
- PyTorch functions and utilities to make your life easier☆195Updated 4 years ago
- Code for Neural Arithmetic Units (ICLR) and Measuring Arithmetic Extrapolation Performance (SEDL|NeurIPS)☆148Updated 4 years ago
- Configure Python functions explicitly and safely☆128Updated last year
- ☆65Updated 5 years ago
- This repository contains the results for the paper: "Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers"☆184Updated 4 years ago
- Implementation for the Lookahead Optimizer.☆243Updated 3 years ago
- HetSeq: Distributed GPU Training on Heterogeneous Infrastructure☆106Updated 2 years ago
- A collection of code snippets for my PyTorch Lightning projects☆107Updated 5 years ago
- Loss Patterns of Neural Networks☆86Updated 4 years ago
- ☆153Updated 5 years ago
- a lightweight transformer library for PyTorch☆72Updated 4 years ago
- Neural Turing Machines in pytorch☆48Updated 4 years ago
- Differentiable Algorithms and Algorithmic Supervision.☆116Updated 2 years ago
- Profile the GPU memory usage of every line in a Pytorch code☆83Updated 7 years ago
- learning to search in pytorch☆110Updated 5 years ago
- ☆54Updated 5 years ago
- Browse the CS/AI/ML research paper graph☆51Updated 2 years ago
- Automatic GPU+CPU memory profiling, re-use and memory leaks detection using jupyter/ipython experiment containers☆230Updated 2 years ago
- GPT, but made only out of MLPs☆89Updated 4 years ago
- DeepOBS: A Deep Learning Optimizer Benchmark Suite☆109Updated 2 years ago
- The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…☆67Updated 3 years ago
- A smoother activation function (undergrad code)☆116Updated 5 years ago
- A demonstration of the attention mechanism with some toy experiments and explanations.☆107Updated 7 years ago
- Toy implementations of some popular ML optimizers using Python/JAX☆44Updated 4 years ago
- An assignment on creating a minimalist neural network toolkit for CS11-747☆64Updated 2 years ago
- Train ImageNet in 18 minutes on AWS☆134Updated last year
- Docs☆143Updated last year
- Code for scaling Transformers☆26Updated 5 years ago
- Unit Testing for pytorch, based on mltest☆312Updated 5 years ago