sseung0703 / Lightweighting_Cookbook
This project attempts to build neural network training and lightweighting cookbook including three kinds of lightweighting solutions, i.e., knowledge distillation, filter pruning, and quantization.
☆22Updated 2 years ago
Alternatives and similar repositories for Lightweighting_Cookbook:
Users that are interested in Lightweighting_Cookbook are comparing it to the libraries listed below
- Tensorflow2 training code with jit compiling on multi-GPU.☆17Updated 4 years ago
- Learning Features with Parameter-Free Layers, ICLR 2022☆85Updated last year
- Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adam☆73Updated 8 months ago
- Jax/Flax implementation of DeiT and DeiT-III (ViT)☆16Updated 3 months ago
- Reproduction of Vision Transformer in Tensorflow2. Train from scratch and Finetune.☆48Updated 3 years ago
- Official repository for Automated Learning Rate Scheduler for Large-Batch Training (8th ICML Workshop on AutoML)☆40Updated 3 years ago
- A minimal implementation of Gaussian process regression in PyTorch☆61Updated 2 years ago
- Serving Example of CodeGen-350M-Mono-GPTJ on Triton Inference Server with Docker and Kubernetes☆20Updated last year
- The OTHER pytorch boilerplate☆44Updated last week
- 이화여대 강의자료☆28Updated last year
- read 1 paper everyday (only weekday)☆56Updated 3 years ago
- my useful torch lightning training template☆32Updated 2 years ago
- Automatic Mixed Precision Tutorials using pytorch. Based on PyTorch 1.6 Official Features, implement classification codebase using custo…☆88Updated 4 years ago
- (ICML 2022) Official PyTorch implementation of “Blurs Behave Like Ensembles: Spatial Smoothings to Improve Accuracy, Uncertainty, and Rob…☆78Updated 2 years ago
- The official repository for <Autoencoding Under Normalization Constraints> (Yoon, Noh and Park, ICML 2021).☆42Updated last year
- ☆186Updated 2 years ago
- Information and Materials for the Deep Learning Course☆30Updated 2 years ago
- Simple llama usage example☆48Updated 2 years ago
- ☆83Updated last year
- Tensorflow Implementation of "Slowing Down the Weight Norm Increase in Momentum-based Optimizers"☆47Updated 3 years ago
- Reproducing the reformer with tf2☆16Updated 4 years ago
- Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena☆204Updated last year
- A GPT, made only of MLPs, in Jax☆57Updated 3 years ago
- PyTorch Model Compression☆231Updated 2 years ago
- [ICLR 2023] RC-MAE☆51Updated last year
- A PyTorch Implementation of the Luna: Linear Unified Nested Attention☆41Updated 3 years ago
- Yet Another Reinforcement Learning Tutorial☆73Updated 2 years ago
- A performance library for machine learning applications.☆183Updated last year
- Tensorflow implementation of "Tackling the Generative Learning Trilemma with Denoising Diffusion GANs" (ICLR 2022 Spotlight)☆20Updated 2 years ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆83Updated last year