yk / patch-torch-save
Patches the torch.save function with arbitrary code that gets executed upon torch.load.
☆71Updated 2 years ago
Alternatives and similar repositories for patch-torch-save:
Users that are interested in patch-torch-save are comparing it to the libraries listed below
- ☆48Updated 3 years ago
- Scripts to prep PC for development use after OS installs☆37Updated last week
- Lightning HPO & Training Studio App☆18Updated last year
- Gzip and nearest neighbors for text classification☆56Updated last year
- A case study of efficient training of large language models using commodity hardware.☆68Updated 2 years ago
- ☆18Updated 2 years ago
- ☆39Updated 2 years ago
- Check if you have training samples in your test set☆64Updated 2 years ago
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)☆186Updated 2 years ago
- Resources from the EleutherAI Math Reading Group☆52Updated 2 months ago
- Common Python utilities and GitHub Actions in Lightning Ecosystem☆52Updated this week
- Lite Inference Toolkit (LIT) for PyTorch☆161Updated 3 years ago
- Keras style progressbar for Pytorch (PK Bar)☆32Updated 9 months ago
- The spiritual successor to knockknock for PyTorch Lightning, get notified when your training ends☆76Updated 2 months ago
- MinT: Minimal Transformer Library and Tutorials☆252Updated 2 years ago
- ☆71Updated last year
- HomebrewNLP in JAX flavour for maintable TPU-Training☆48Updated last year
- Train vision models using JAX and 🤗 transformers☆96Updated 3 weeks ago
- My implementation of DeepMind's Perceiver☆61Updated 3 years ago
- My own attempt at a long context genomics model, leveraging recent advances in long context attention modeling (Flash Attention + other h…☆52Updated last year
- One stop shop for all things carp☆59Updated 2 years ago
- A pure-functional implementation of a machine learning transformer model in Python/JAX☆177Updated 2 weeks ago
- Named tensors with first-class dimensions for PyTorch☆321Updated last year
- Exploring finetuning public checkpoints on filter 8K sequences on Pile☆115Updated last year
- [WIP] A 🔥 interface for running code in the cloud☆86Updated last year
- An alternative to convolution in neural networks☆254Updated 10 months ago
- Distributed skorch on Ray Train☆57Updated 2 years ago
- Latent Diffusion Language Models☆68Updated last year
- Babysit your preemptible TPUs☆85Updated 2 years ago
- Train fastai models faster (and other useful tools)☆64Updated 8 months ago