archsyscall / aquvitaeLinks
Knowledge Distillation Toolkit
☆88Updated 4 years ago
Alternatives and similar repositories for aquvitae
Users that are interested in aquvitae are comparing it to the libraries listed below
Sorting:
- A 🤗-style implementation of BERT using lambda layers instead of self-attention☆69Updated 4 years ago
- PyTorch Examples repo for "ReZero is All You Need: Fast Convergence at Large Depth"☆62Updated 10 months ago
- Code for scaling Transformers☆26Updated 4 years ago
- ☆47Updated 4 years ago
- Unofficial PyTorch Implementation of EvoNorm☆122Updated 3 years ago
- Implementations of quasi-hyperbolic optimization algorithms.☆102Updated 5 years ago
- a lightweight transformer library for PyTorch☆71Updated 3 years ago
- PyTorch implementation of L2L execution algorithm☆107Updated 2 years ago
- A large scale study of Knowledge Distillation.☆220Updated 5 years ago
- Configure Python functions explicitly and safely☆126Updated 6 months ago
- Visualising the Transformer encoder☆111Updated 4 years ago
- TF 2.x and PyTorch Lightning Callbacks for GPU monitoring☆92Updated 4 years ago
- High Performance Tensorflow Data Pipeline with State of Art Augmentations and low level optimizations.☆86Updated 3 years ago
- ☆54Updated 4 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆147Updated 3 years ago
- Distillation of BERT model with catalyst framework☆78Updated last year
- Large dataset storage format for Pytorch☆45Updated 3 years ago
- Parameterized fit and prediction harnesses for pytorch☆40Updated 4 years ago
- All Model summary in PyTorch similar to `model.summary()` in Keras☆88Updated 6 years ago
- Convenient DL serving☆72Updated 3 years ago
- ☆54Updated 5 years ago
- Loss Patterns of Neural Networks☆85Updated 3 years ago
- 🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.☆68Updated 5 years ago
- Creates a learning-curve plot for Jupyter/Colab notebooks that is updated in real-time.☆176Updated 3 years ago
- PyTorch implementation of the NIPS'17 paper Training Deep Networks without Learning Rates Through Coin Betting.☆37Updated 7 years ago
- A fork of the official TPU models repo with fixes and a solution of the Kaggle Open Images 2019 Object Detection Challenge☆49Updated 5 years ago
- Simple stochastic weight averaging callback for Keras☆63Updated 3 years ago
- Keras/TF implementation of AdamW, SGDW, NadamW, Warm Restarts, and Learning Rate multipliers☆167Updated 3 years ago
- A collection of code snippets for my PyTorch Lightning projects☆107Updated 4 years ago
- Train ImageNet in 18 minutes on AWS☆130Updated last year