Dynamic Tensor Rematerialization prototype (modified PyTorch) and simulator. Paper: https://arxiv.org/abs/2006.09616
☆133Jul 6, 2023Updated 2 years ago
Alternatives and similar repositories for dtr-prototype
Users that are interested in dtr-prototype are comparing it to the libraries listed below
Sorting:
- PSTensor provides a way to hack the memory management of tensors in TensorFlow and PyTorch by defining your own C++ Tensor Class.☆10Feb 10, 2022Updated 4 years ago
- Re-implementation of the TASO compiler using equality saturation☆138Jun 28, 2021Updated 4 years ago
- Thinking is hard - automate it☆18Aug 24, 2022Updated 3 years ago
- Training neural networks in TensorFlow 2.0 with 5x less memory☆137Feb 21, 2022Updated 4 years ago
- Equivalent and redundant mutant detection with e-graphs!!!☆13Jun 14, 2023Updated 2 years ago
- Research and development for optimizing transformers☆131Feb 16, 2021Updated 5 years ago
- PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications☆127May 9, 2022Updated 3 years ago
- this is the release repository of superneurons☆54Feb 13, 2021Updated 5 years ago
- DELTA-pytorch:DELTA: Dynamically Optimizing GPU Memory beyond Tensor Recomputation☆12Apr 16, 2024Updated last year
- An external memory allocator example for PyTorch.☆16Aug 10, 2025Updated 7 months ago
- Fine-grained GPU sharing primitives☆147Jul 28, 2025Updated 7 months ago
- An experimental ahead of time compiler for Relay.☆49Apr 21, 2020Updated 5 years ago
- A framework that helps implementing swizzle GPU kernels☆51Feb 29, 2020Updated 6 years ago
- PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections☆125Jun 23, 2022Updated 3 years ago
- ☆192Mar 28, 2023Updated 2 years ago
- Haskell experiments involving TVM AI framework☆20Apr 26, 2019Updated 6 years ago
- MONeT framework for reducing memory consumption of DNN training☆174May 4, 2021Updated 4 years ago
- Drop-in library for tracking the memory allocations of CUDA applications☆14Nov 17, 2017Updated 8 years ago
- Depict GPU memory footprint during DNN training of PyTorch☆11Nov 17, 2022Updated 3 years ago
- Term project for TaPL. A mini coq-like proof assistant.☆17Jun 17, 2018Updated 7 years ago
- ☆23Apr 28, 2023Updated 2 years ago
- Slicing a PyTorch Tensor Into Parallel Shards☆300Jun 7, 2025Updated 9 months ago
- Fairring (FAIR + Herring) is a plug-in for PyTorch that provides a process group for distributed training that outperforms NCCL at large …☆66Mar 21, 2022Updated 4 years ago
- ☆36Mar 12, 2026Updated last week
- Race Condition Running☆11Mar 14, 2026Updated last week
- An extention of TVMScript to write simple and high performance GPU kernels with tensorcore.☆50Jul 23, 2024Updated last year
- TensorFlow and TVM integration☆36Apr 27, 2020Updated 5 years ago
- Artifact for OSDI'23: MGG: Accelerating Graph Neural Networks with Fine-grained intra-kernel Communication-Computation Pipelining on Mult…☆41Mar 17, 2024Updated 2 years ago
- ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training☆199Dec 22, 2022Updated 3 years ago
- WIP. Veloce is a low-code Ray-based parallelization library that makes machine learning computation novel, efficient, and heterogeneous.☆17Aug 4, 2022Updated 3 years ago
- [MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration☆199Apr 27, 2022Updated 3 years ago
- Visualize TVM Relay program graph☆12Nov 19, 2019Updated 6 years ago
- PatrickStar enables Larger, Faster, Greener Pretrained Models for NLP and democratizes AI for everyone.☆778Nov 18, 2025Updated 4 months ago
- ML model training for edge devices☆168Sep 29, 2023Updated 2 years ago
- Boost hardware utilization for ML training workloads via Inter-model Horizontal Fusion☆32May 15, 2024Updated last year
- ☆78May 4, 2021Updated 4 years ago
- The Tensor Algebra SuperOptimizer for Deep Learning☆740Jan 26, 2023Updated 3 years ago
- Microsoft Collective Communication Library☆387Sep 20, 2023Updated 2 years ago
- High performance RDMA-based distributed feature collection component for training GNN model on EXTREMELY large graph☆55Jul 3, 2022Updated 3 years ago