archsyscall / aquvitaeLinks
Knowledge Distillation Toolkit
☆88Updated 5 years ago
Alternatives and similar repositories for aquvitae
Users that are interested in aquvitae are comparing it to the libraries listed below
Sorting:
- Provides a systematic and extensible way to build, train, evaluate, and tune deep learning models using PyTorch.☆94Updated last year
- High Performance Tensorflow Data Pipeline with State of Art Augmentations and low level optimizations.☆85Updated 3 years ago
- TF 2.x and PyTorch Lightning Callbacks for GPU monitoring☆92Updated 5 years ago
- Browse the CS/AI/ML research paper graph☆51Updated 2 years ago
- A pytorch based classification experiments template☆46Updated 4 years ago
- PyTorch Examples repo for "ReZero is All You Need: Fast Convergence at Large Depth"☆62Updated last year
- ☆104Updated 4 years ago
- ☆54Updated 5 years ago
- Torchélie is a set of utility functions, layers, losses, models, trainers and other things for PyTorch.☆110Updated last month
- All Model summary in PyTorch similar to `model.summary()` in Keras☆88Updated 6 years ago
- Convenient DL serving☆72Updated 4 years ago
- A deep learning library based on Pytorch focussed on low resource language research and robustness☆70Updated 3 years ago
- ☆14Updated 6 years ago
- Large dataset storage format for Pytorch☆45Updated 4 years ago
- Implementation of Rectified Adam in Keras☆70Updated 6 years ago
- Code for scaling Transformers☆26Updated 4 years ago
- Implementation of the LAMB optimizer for Keras from the paper "Reducing BERT Pre-Training Time from 3 Days to 76 Minutes"☆75Updated 6 years ago
- Quick modules to turn regular Neural Networks to Bayesian Neural Networks with Dropout.☆35Updated 4 years ago
- Creates a learning-curve plot for Jupyter/Colab notebooks that is updated in real-time.☆177Updated 3 years ago
- TF2.0 port for Augmix paper☆79Updated 5 years ago
- A 🤗-style implementation of BERT using lambda layers instead of self-attention☆69Updated 4 years ago
- A supplementary code for Beyond Vector Spaces: Compact Data Representation as Differentiable Weighted Graphs.☆47Updated 5 years ago
- Keras implementation of Padam from "Closing the Generalization Gap of Adaptive Gradient Methods in Training Deep Neural Networks"☆17Updated 7 years ago
- Simple Tensorflow implementation of "On The Variance Of The Adaptive Learning Rate And Beyond"☆97Updated 5 years ago
- The stand-alone training engine module for the ALOHA.eu project.☆15Updated 5 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆147Updated 4 years ago
- Configure Python functions explicitly and safely☆127Updated 10 months ago
- Machine-generated summaries and highlights of the every accepted paper at Thirty-second Conference on Neural Information Processing Syste…☆71Updated 6 years ago
- PyTorch implementation of the NIPS'17 paper Training Deep Networks without Learning Rates Through Coin Betting.☆37Updated 7 years ago
- Code repo for "Transformer on a Diet" paper☆31Updated 5 years ago