ppwwyyxx / RAM-multiprocess-dataloaderLinks

Demystify RAM Usage in Multi-Process Data Loaders

☆203

Alternatives and similar repositories for RAM-multiprocess-dataloader

Users that are interested in RAM-multiprocess-dataloader are comparing it to the libraries listed below

Sorting:

kaiyuyue / torchshard
Slicing a PyTorch Tensor Into Parallel Shards
☆301Updated 4 months ago
zhijian-liu / torchprofile
A general and accurate MACs / FLOPs profiler for PyTorch models
☆630Updated 2 months ago
lucidrains / flash-cosine-sim-attention
Implementation of fused cosine similarity attention in the same style as Flash Attention
☆217Updated 2 years ago
meta-pytorch / torcheval
A library that contains a rich collection of performant PyTorch model metrics, a simple interface to create new metrics, a toolkit to fac…
☆241Updated 3 weeks ago
1adrianb / pytorch-estimate-flops
Estimate/count FLOPS for a given neural network using pytorch
☆306Updated 3 years ago
tmbdev-archive / webdataset-examples
Examples for the WebDataset PyTorch Dataset Library
☆51Updated 4 years ago
lucidrains / triton-transformer
Implementation of a Transformer, but completely in Triton
☆275Updated 3 years ago
ucbrise / actnn
ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training
☆199Updated 2 years ago
PhilJd / contiguous_pytorch_params
Accelerate training by storing parameters in one contiguous chunk of memory.
☆292Updated 4 years ago
frgfm / torch-scan
Seamless analysis of your PyTorch models (RAM usage, FLOPs, MACs, receptive field, etc.)
☆222Updated 6 months ago
DeMoriarty / TorchPQ
Approximate nearest neighbor search with product quantization on GPU in pytorch and cuda
☆227Updated last year
NVIDIA / PyProf
A GPU performance profiling tool for PyTorch models
☆507Updated 4 years ago
fuzihaofzh / cstl
The C++ Standard Template Library (STL) for Python.
☆24Updated 2 years ago
lucidrains / memory-efficient-attention-pytorch
Implementation of a memory efficient multi-head attention as proposed in the paper, "Self-attention Does Not Need O(n²) Memory"
☆383Updated 2 years ago
Lightning-AI / forked-pdb
Python pdb for multiple processes
☆59Updated 4 months ago
facebookresearch / dropout
Code release for "Dropout Reduces Underfitting"
☆315Updated 2 years ago
facebookresearch / SWAG
Official repository for "Revisiting Weakly Supervised Pre-Training of Visual Perception Models". https://arxiv.org/abs/2201.08371.
☆180Updated 3 years ago
ivan-chai / torch-linear-assignment
Batch computation of the linear assignment problem on GPU.
☆94Updated last month
kakaobrain / torchlars
A LARS implementation in PyTorch
☆352Updated 5 years ago
fumihwh / onnx-pytorch
A code generator from ONNX to PyTorch code
☆141Updated 2 years ago
NVIDIA / transformer-ls
Official PyTorch Implementation of Long-Short Transformer (NeurIPS 2021).
☆228Updated 3 years ago
Alibaba-MIIL / Solving_ImageNet
Official PyTorch implementation of the paper: "Solving ImageNet: a Unified Scheme for Training any Backbone to Top Results" (2022)
☆193Updated 2 years ago
zzd1992 / Image-Local-Attention
A better PyTorch implementation of image local attention which reduces the GPU memory by an order of magnitude.
☆140Updated 3 years ago
fadel / pytorch_ema
Tiny PyTorch library for maintaining a moving average of a collection of parameters.
☆437Updated last year
Stonesjtu / pytorch_memlab
Profiling and inspecting memory in pytorch
☆1,072Updated last month
google-research / vmoe
☆680Updated 2 months ago
pytorch / nestedtensor
[Prototype] Tools for the concurrent manipulation of variably sized Tensors.
☆251Updated 2 years ago
facebookresearch / bitsandbytes
Library for 8-bit optimizers and quantization routines.
☆779Updated 3 years ago
prigoyal / pytorch_memonger
Experimental ground for optimizing memory of pytorch models
☆367Updated 7 years ago
vra / flopth
A simple program to calculate and visualize the FLOPs and Parameters of Pytorch models, with handy CLI and easy-to-use Python API.
☆131Updated 10 months ago