andravin / spioLinks
Spio (SPEE-oh) - Experimental CUDA kernel framework unifying typed dimensions, NVRTC JIT specialization, and ML‑guided tuning.
☆46Updated this week
Alternatives and similar repositories for spio
Users that are interested in spio are comparing it to the libraries listed below
Sorting:
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…☆126Updated last year
- The AdEMAMix Optimizer: Better, Faster, Older.☆186Updated last year
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆160Updated last year
- TorchFix - a linter for PyTorch-using code with autofix support☆152Updated 4 months ago
- ☆92Updated last year
- ☆59Updated last year
- Mobile Viewer for W&B, built on top of Flutter.☆39Updated last year
- Little article showing how to load pytorch's models with linear memory consumption☆34Updated 3 years ago
- Context Manager to profile the forward and backward times of PyTorch's nn.Module☆83Updated 2 years ago
- FID computation in Jax/Flax.☆29Updated last year
- Utilities for PyTorch distributed☆25Updated 10 months ago
- Timm model explorer☆42Updated last year
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.☆22Updated 2 years ago
- Supercharge Your PyTorch Image Models: Bag of Tricks to 8x Faster Inference with ONNX Runtime & Optimizations☆23Updated last year
- Cyclemoid implementation for PyTorch☆90Updated 3 years ago
- Fast, Modern, and Low Precision PyTorch Optimizers☆119Updated last week
- Attempt to make multiple residual streams from Bytedance's Hyper-Connections paper accessible to the public☆124Updated this week
- ☆75Updated 3 years ago
- Implementation of the proposed Adam-atan2 from Google Deepmind in Pytorch☆134Updated 2 months ago
- A library that contains a rich collection of performant PyTorch model metrics, a simple interface to create new metrics, a toolkit to fac…☆245Updated 2 weeks ago
- Focused on fast experimentation and simplicity☆78Updated last year
- Code and weights for the paper "Cluster and Predict Latents Patches for Improved Masked Image Modeling"☆125Updated 2 weeks ago
- ☆16Updated last year
- ☆51Updated last year
- ☆133Updated 2 years ago
- CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds☆340Updated last month
- Easily run PyTorch on multiple GPUs & machines☆56Updated last month
- A lightweight library designed to accelerate the process of training PyTorch models by providing a minimal, but extensible training loop …☆193Updated this week
- Explorations into the recently proposed Taylor Series Linear Attention☆100Updated last year
- Train vision models using JAX and 🤗 transformers☆100Updated 3 weeks ago