andravin / spioLinks
Experimental CUDA kernel framework unifying typed dimensions, NVRTC JIT specialization, and ML‑guided tuning.
☆45Updated this week
Alternatives and similar repositories for spio
Users that are interested in spio are comparing it to the libraries listed below
Sorting:
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…☆125Updated last year
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆160Updated last year
- Supercharge Your PyTorch Image Models: Bag of Tricks to 8x Faster Inference with ONNX Runtime & Optimizations☆23Updated last year
- Timm model explorer☆42Updated last year
- ☆59Updated last year
- Fast, Modern, and Low Precision PyTorch Optimizers☆116Updated 3 months ago
- Utilities for PyTorch distributed☆25Updated 9 months ago
- Implementation of the proposed Adam-atan2 from Google Deepmind in Pytorch☆134Updated last month
- A lightweight library designed to accelerate the process of training PyTorch models by providing a minimal, but extensible training loop …☆192Updated 6 months ago
- The AdEMAMix Optimizer: Better, Faster, Older.☆186Updated last year
- ☆91Updated last year
- Implementation of Infini-Transformer in Pytorch☆113Updated 11 months ago
- ☆134Updated 2 years ago
- TorchFix - a linter for PyTorch-using code with autofix support☆152Updated 3 months ago
- Mobile Viewer for W&B, built on top of Flutter.☆39Updated last year
- supporting pytorch FSDP for optimizers☆84Updated last year
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.☆22Updated 2 years ago
- ☆75Updated 3 years ago
- Little article showing how to load pytorch's models with linear memory consumption☆34Updated 3 years ago
- VIT inference in triton because, why not?☆32Updated last year
- Implementation of a Light Recurrent Unit in Pytorch☆49Updated last year
- Simplify Your Visual Data Ops. Find and visualize issues with your computer vision datasets such as duplicates, anomalies, data leakage, …☆69Updated 7 months ago
- Easily run PyTorch on multiple GPUs & machines☆54Updated last week
- Focused on fast experimentation and simplicity☆75Updated 11 months ago
- Code release for "Dropout Reduces Underfitting"☆317Updated 2 years ago
- An implementation of PSGD Kron second-order optimizer for PyTorch☆97Updated 4 months ago
- Notebooks to demonstrate TimmWrapper☆16Updated 10 months ago
- Official implementation of the paper: "ZClip: Adaptive Spike Mitigation for LLM Pre-Training".☆139Updated 3 weeks ago
- Context Manager to profile the forward and backward times of PyTorch's nn.Module☆83Updated 2 years ago
- [ICCV25] Official Implementation of LeGrad☆83Updated last year