andravin / spioLinks
Experimental CUDA kernel framework unifying typed dimensions, NVRTC JIT specialization, and ML‑guided tuning.
☆46Updated this week
Alternatives and similar repositories for spio
Users that are interested in spio are comparing it to the libraries listed below
Sorting:
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…☆126Updated last year
- Little article showing how to load pytorch's models with linear memory consumption☆34Updated 3 years ago
- TorchFix - a linter for PyTorch-using code with autofix support☆152Updated 5 months ago
- Supercharge Your PyTorch Image Models: Bag of Tricks to 8x Faster Inference with ONNX Runtime & Optimizations☆24Updated last year
- Context Manager to profile the forward and backward times of PyTorch's nn.Module☆83Updated 2 years ago
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆160Updated last year
- The AdEMAMix Optimizer: Better, Faster, Older.☆186Updated last year
- Simplify Your Visual Data Ops. Find and visualize issues with your computer vision datasets such as duplicates, anomalies, data leakage, …☆69Updated 9 months ago
- Timm model explorer☆42Updated last year
- ☆92Updated last year
- A library that contains a rich collection of performant PyTorch model metrics, a simple interface to create new metrics, a toolkit to fac…☆246Updated last week
- FID computation in Jax/Flax.☆29Updated last year
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.☆22Updated 3 years ago
- CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds☆352Updated 2 months ago
- Mobile Viewer for W&B, built on top of Flutter.☆40Updated last year
- Lightning HPO & Training Studio App☆19Updated 2 years ago
- ☆51Updated last year
- VIT inference in triton because, why not?☆36Updated last year
- Implementation of the proposed Adam-atan2 from Google Deepmind in Pytorch☆135Updated 3 months ago
- Presents comprehensive benchmarks of XLA-compatible pre-trained models in Keras.☆37Updated 2 years ago
- Hacks for PyTorch☆19Updated 2 years ago
- Fast, Modern, and Low Precision PyTorch Optimizers☆124Updated last month
- Cyclemoid implementation for PyTorch☆90Updated 3 years ago
- Implementation of fused cosine similarity attention in the same style as Flash Attention☆220Updated 2 years ago
- A library for unit scaling in PyTorch☆133Updated 7 months ago
- Focused on fast experimentation and simplicity☆80Updated last year
- ☆75Updated 3 years ago
- ☆39Updated last year
- Utilities for PyTorch distributed☆25Updated 11 months ago
- Explorations into the recently proposed Taylor Series Linear Attention☆100Updated last year