lernapparat / torchhacksLinks

Hacks for PyTorch

☆19

Alternatives and similar repositories for torchhacks

Users that are interested in torchhacks are comparing it to the libraries listed below

Sorting:

srush / triton-autodiff
Experiment of using Tangent to autodiff triton
☆80Updated last year
lucidrains / autoregressive-linear-attention-cuda
CUDA implementation of autoregressive linear attention, with all the latest research findings
☆46Updated 2 years ago
drisspg / transformer_nuggets
A place to store reusable transformer components of my own creation or found on the interwebs
☆62Updated 2 weeks ago
graphcore-research / out-of-the-box-fp8-training
Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.
☆46Updated last year
DeMoriarty / custom_matmul_kernels
Customized matrix multiplication kernels
☆57Updated 3 years ago
pytorch / torchdistx
Torch Distributed Experimental
☆117Updated last year
ahennequ / pytorch-custom-mma
☆29Updated 3 years ago
UmerHA / triton_util
Make triton easier
☆49Updated last year
kshitij12345 / torchnnprofiler
Context Manager to profile the forward and backward times of PyTorch's nn.Module
☆83Updated 2 years ago
facebookexperimental / protoquant
Prototype routines for GPU quantization written using PyTorch.
☆21Updated 4 months ago
lianakoleva / no-libtorch-compile
☆21Updated 9 months ago
pytorch / maskedtensor
MaskedTensors for PyTorch
☆38Updated 3 years ago
amirzandieh / HyperAttention
Triton Implementation of HyperAttention Algorithm
☆48Updated last year
meta-pytorch / torchfix
TorchFix - a linter for PyTorch-using code with autofix support
☆151Updated 3 months ago
Zasder3 / open_clip_juwels
An open source implementation of CLIP.
☆33Updated 3 years ago
ezyang / torchdbg
PyTorch centric eager mode debugger
☆48Updated 11 months ago
MathInf / toroidal
a lightweight transformer library for PyTorch
☆72Updated 4 years ago
matthias-wright / jax-fid
FID computation in Jax/Flax.
☆29Updated last year
crypdick / timm-lr-scheduler-explorer
A dashboard for exploring timm learning rate schedulers
☆19Updated last year
ahennequ / cuda-tensorcores-register-mapping
☆19Updated 3 years ago
GindaChen / FlexFlashAttention3
FlexAttention w/ FlashAttention3 Support
☆27Updated last year
Quansight / pytest-pytorch
pytest plugin for a better developer experience when working with the PyTorch test suite
☆44Updated 3 years ago
facebookresearch / any4
Quantize transformers to any learned arbitrary 4-bit numeric format
☆49Updated 4 months ago
meta-pytorch / superblock
A block oriented training approach for inference time optimization.
☆33Updated last year
NVIDIA / free-threaded-python
No-GIL Python environment featuring NVIDIA Deep Learning libraries.
☆69Updated 7 months ago
HomebrewML / HomebrewNLP-torch
A case study of efficient training of large language models using commodity hardware.
☆68Updated 3 years ago
rwightman / imagenet-12k
ImageNet-12k subset of ImageNet-21k (fall11)
☆21Updated 2 years ago
facebookresearch / adaptive_scheduling
Experimental scripts for researching data adaptive learning rate scheduling.
☆22Updated 2 years ago
FrancescoSaverioZuppichini / Loading-huge-PyTorch-models-with-linear-memory-consumption
Little article showing how to load pytorch's models with linear memory consumption
☆34Updated 3 years ago
lucidrains / rela-transformer
Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012
☆49Updated 3 years ago