andravin / spioLinks
Experimental CUDA kernel framework unifying typed dimensions, NVRTC JIT specialization, and ML‑guided tuning.
☆46Updated last week
Alternatives and similar repositories for spio
Users that are interested in spio are comparing it to the libraries listed below
Sorting:
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…☆126Updated last year
- Little article showing how to load pytorch's models with linear memory consumption☆34Updated 3 years ago
- ☆92Updated last year
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆160Updated last year
- ☆133Updated 2 years ago
- Simplify Your Visual Data Ops. Find and visualize issues with your computer vision datasets such as duplicates, anomalies, data leakage, …☆69Updated 8 months ago
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.☆22Updated 3 years ago
- Context Manager to profile the forward and backward times of PyTorch's nn.Module☆83Updated 2 years ago
- ☆75Updated 3 years ago
- Utilities for PyTorch distributed☆25Updated 11 months ago
- CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds☆349Updated 2 months ago
- The AdEMAMix Optimizer: Better, Faster, Older.☆186Updated last year
- Fast, Modern, and Low Precision PyTorch Optimizers☆120Updated last month
- ☆51Updated last year
- TorchFix - a linter for PyTorch-using code with autofix support☆152Updated 5 months ago
- A library that contains a rich collection of performant PyTorch model metrics, a simple interface to create new metrics, a toolkit to fac…☆247Updated 2 weeks ago
- supporting pytorch FSDP for optimizers☆84Updated last year
- ☆59Updated last year
- Hacks for PyTorch☆19Updated 2 years ago
- Implementation of the proposed Adam-atan2 from Google Deepmind in Pytorch☆134Updated 3 months ago
- Supercharge Your PyTorch Image Models: Bag of Tricks to 8x Faster Inference with ONNX Runtime & Optimizations☆23Updated last year
- Timm model explorer☆42Updated last year
- FID computation in Jax/Flax.☆29Updated last year
- VIT inference in triton because, why not?☆35Updated last year
- Presents comprehensive benchmarks of XLA-compatible pre-trained models in Keras.☆37Updated 2 years ago
- Lightning HPO & Training Studio App☆19Updated 2 years ago
- Mobile Viewer for W&B, built on top of Flutter.☆40Updated last year
- Implementation of fused cosine similarity attention in the same style as Flash Attention☆220Updated 2 years ago
- Experiment of using Tangent to autodiff triton☆82Updated 2 years ago
- Focused on fast experimentation and simplicity☆80Updated last year