archsyscall / aquvitaeLinks
Knowledge Distillation Toolkit
☆88Updated 4 years ago
Alternatives and similar repositories for aquvitae
Users that are interested in aquvitae are comparing it to the libraries listed below
Sorting:
- High Performance Tensorflow Data Pipeline with State of Art Augmentations and low level optimizations.☆86Updated 3 years ago
- Official Pytorch Implementation of Length-Adaptive Transformer (ACL 2021)☆101Updated 4 years ago
- PyTorch implementation of L2L execution algorithm☆107Updated 2 years ago
- Implementations of quasi-hyperbolic optimization algorithms.☆102Updated 5 years ago
- Convenient DL serving☆72Updated 3 years ago
- ☆28Updated 6 years ago
- Configure Python functions explicitly and safely☆126Updated 7 months ago
- Code for scaling Transformers☆26Updated 4 years ago
- A pytorch based classification experiments template☆46Updated 4 years ago
- A large scale study of Knowledge Distillation.☆220Updated 5 years ago
- PyTorch Examples repo for "ReZero is All You Need: Fast Convergence at Large Depth"☆62Updated 11 months ago
- Unofficial PyTorch Implementation of EvoNorm☆122Updated 3 years ago
- a lightweight transformer library for PyTorch☆72Updated 3 years ago
- PyTorch DataLoader processed in multiple remote computation machines for heavy data processings☆67Updated 5 years ago
- Yet Another Neural Network Library 🤔☆27Updated 2 months ago
- ☆40Updated last year
- PyTorchPipe (PTP) is a component-oriented framework for rapid prototyping and training of computational pipelines combining vision and la …☆226Updated 5 years ago
- Provides a systematic and extensible way to build, train, evaluate, and tune deep learning models using PyTorch.☆94Updated last year
- ☆14Updated 6 years ago
- TF 2.x and PyTorch Lightning Callbacks for GPU monitoring☆92Updated 5 years ago
- A 🤗-style implementation of BERT using lambda layers instead of self-attention☆69Updated 4 years ago
- Loss Patterns of Neural Networks☆85Updated 3 years ago
- TF2.0 port for Augmix paper☆79Updated 5 years ago
- Implements sharpness-aware minimization (https://arxiv.org/abs/2010.01412) in TensorFlow 2.☆60Updated 3 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆147Updated 3 years ago
- ☆54Updated 5 years ago
- Swish Activation - PyTorch CUDA Implementation☆37Updated 5 years ago
- ☆35Updated 5 years ago
- Train ImageNet in 18 minutes on AWS☆130Updated last year
- Implementation of Rectified Adam in Keras☆70Updated 5 years ago