zomux / dlmonitor
☆148Updated 2 years ago
Alternatives and similar repositories for dlmonitor:
Users that are interested in dlmonitor are comparing it to the libraries listed below
- ☆102Updated 4 years ago
- ☆153Updated 4 years ago
- PyTorch functions and utilities to make your life easier☆195Updated 3 years ago
- A collection of code snippets for my PyTorch Lightning projects☆107Updated 4 years ago
- PyTorch dataset extended with map, cache etc. (tensorflow.data like)☆328Updated 2 years ago
- Unit Testing for pytorch, based on mltest☆311Updated 4 years ago
- a lightweight transformer library for PyTorch☆72Updated 3 years ago
- learning to search in pytorch☆110Updated 4 years ago
- The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…☆66Updated 2 years ago
- Configure Python functions explicitly and safely☆126Updated 2 months ago
- GPT, but made only out of MLPs☆88Updated 3 years ago
- ☆108Updated 2 years ago
- EfficientNet, MobileNetV3, MobileNetV2, MixNet, etc in JAX w/ Flax Linen and Objax☆127Updated last year
- ☆64Updated 4 years ago
- 🧀 Pytorch code for the Fromage optimiser.☆123Updated 6 months ago
- ☆28Updated 5 years ago
- A demonstration of the attention mechanism with some toy experiments and explanations.☆106Updated 6 years ago
- Neural Turing Machines in pytorch☆48Updated 3 years ago
- Research boilerplate for PyTorch.☆150Updated last year
- Tensor Shape Annotation Library (numpy, tensorflow, pytorch, ...)☆264Updated 4 years ago
- ☆67Updated last year
- Loss Patterns of Neural Networks☆83Updated 3 years ago
- This repository contains the results for the paper: "Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers"☆180Updated 3 years ago
- HetSeq: Distributed GPU Training on Heterogeneous Infrastructure☆106Updated last year
- Understanding the Difficulty of Training Transformers☆328Updated 2 years ago
- Functional deep learning☆108Updated 2 years ago
- LM Pretraining with PyTorch/TPU☆134Updated 5 years ago
- Configuration classes enabling type-safe PyTorch configuration for Hydra apps☆210Updated 2 years ago
- Official code for the Stochastic Polyak step-size optimizer☆138Updated 7 months ago
- [NeurIPS 2022] DataMUX: Data Multiplexing for Neural Networks☆60Updated 2 years ago