marload / aquvitae
Knowledge Distillation Toolkit
☆90Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for aquvitae
- PyTorch implementation of L2L execution algorithm☆106Updated last year
- High Performance Tensorflow Data Pipeline with State of Art Augmentations and low level optimizations.☆86Updated 2 years ago
- TF 2.x and PyTorch Lightning Callbacks for GPU monitoring☆92Updated 4 years ago
- A deep learning library based on Pytorch focussed on low resource language research and robustness☆69Updated 2 years ago
- Quick modules to turn regular Neural Networks to Bayesian Neural Networks with Dropout.☆35Updated 3 years ago
- Train ImageNet in 18 minutes on AWS☆126Updated 7 months ago
- A pytorch based classification experiments template☆46Updated 3 years ago
- Provides a systematic and extensible way to build, train, evaluate, and tune deep learning models using PyTorch.☆93Updated 4 months ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆145Updated 3 years ago
- TF2.0 port for Augmix paper☆79Updated 4 years ago
- ☆101Updated 3 years ago
- Official Pytorch Implementation of Length-Adaptive Transformer (ACL 2021)☆100Updated 4 years ago
- Large dataset storage format for Pytorch☆45Updated 3 years ago
- Convenient DL serving☆72Updated 3 years ago
- ☆143Updated last year
- Configure Python functions explicitly and safely☆126Updated last year
- PyTorch Model Compression☆230Updated last year
- Simple implementation of the LSUV initialization in PyTorch☆58Updated 9 months ago
- PyTorch Java demo☆36Updated 4 years ago
- ☆40Updated last year
- Code for scaling Transformers☆26Updated 3 years ago
- Loss Patterns of Neural Networks☆82Updated 3 years ago
- 🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.☆68Updated 4 years ago
- "Learning Rate Dropout" in PyTorch☆34Updated 4 years ago
- ☆52Updated 4 years ago
- a lightweight transformer library for PyTorch☆72Updated 3 years ago
- Minimal implementation of adaptive gradient clipping (https://arxiv.org/abs/2102.06171) in TensorFlow 2.☆80Updated 3 years ago
- A general, modular, and programmable architecture search framework☆121Updated last year