AminRezaei0x443 / PyTorch-LIT
Lite Inference Toolkit (LIT) for PyTorch
☆161Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for PyTorch-LIT
- The "tl;dr" on a few notable transformer papers (pre-2022).☆189Updated last year
- Check if you have training samples in your test set☆64Updated 2 years ago
- My implementation of DeepMind's Perceiver☆61Updated 3 years ago
- Memory Efficient Attention (O(sqrt(n)) for Jax and PyTorch☆179Updated last year
- A fastai/PyTorch package for unpaired image-to-image translation.☆133Updated last year
- Functional deep learning☆106Updated 2 years ago
- Python Research Framework☆107Updated 2 years ago
- A library to inspect and extract intermediate layers of PyTorch models.☆470Updated 2 years ago
- Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodes☆237Updated last year
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆153Updated 11 months ago
- An alternative to convolution in neural networks☆251Updated 7 months ago
- Library for 8-bit optimizers and quantization routines.☆714Updated 2 years ago
- ADAS is short for Adaptive Step Size, it's an optimizer that unlike other optimizers that just normalize the derivative, it fine-tunes th…☆85Updated 3 years ago
- RUDOLPH: One Hyper-Tasking Transformer can be creative as DALL-E and GPT-3 and smart as CLIP☆253Updated last year
- JAX implementation of VQGAN☆89Updated 2 years ago
- A simple library that implements CLIP guided loss in PyTorch.☆77Updated 2 years ago
- graftr: an interactive shell to view and edit PyTorch checkpoints.☆110Updated 4 years ago
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.☆126Updated 3 years ago
- Babysit your preemptible TPUs☆84Updated last year
- A pure-functional implementation of a machine learning transformer model in Python/JAX☆175Updated 2 years ago
- Implementation of the Adan (ADAptive Nesterov momentum algorithm) Optimizer in Pytorch☆247Updated 2 years ago
- HetSeq: Distributed GPU Training on Heterogeneous Infrastructure☆106Updated last year
- Simple Annotated implementation of GPT-NeoX in PyTorch☆111Updated 2 years ago
- Learning to Initialize Neural Networks for Stable and Efficient Training☆136Updated 2 years ago
- A case study of efficient training of large language models using commodity hardware.☆68Updated 2 years ago
- My repo for training neural nets using pytorch-lightning and hydra☆214Updated 3 months ago
- Memory mapped numpy arrays of varying shapes☆286Updated 5 months ago
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)☆185Updated 2 years ago
- Open Source Photos Platform Powered by PyTorch☆137Updated 2 years ago
- Official code for "Distributed Deep Learning in Open Collaborations" (NeurIPS 2021)☆116Updated 2 years ago