uwsampl/dtr-prototype

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/uwsampl/dtr-prototype)

uwsampl / dtr-prototype

Dynamic Tensor Rematerialization prototype (modified PyTorch) and simulator. Paper: https://arxiv.org/abs/2006.09616

☆133

Alternatives and similar repositories for dtr-prototype

Users that are interested in dtr-prototype are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

feifeibear / PSTensor
View on GitHub
PSTensor provides a way to hack the memory management of tensors in TensorFlow and PyTorch by defining your own C++ Tensor Class.
☆10Feb 10, 2022Updated 4 years ago
uwplse / tensat
View on GitHub
Re-implementation of the TASO compiler using equality saturation
☆142Jun 28, 2021Updated 5 years ago
darchr / AutoTM
View on GitHub
Thinking is hard - automate it
☆18Aug 24, 2022Updated 3 years ago
parasj / checkmate
View on GitHub
Training neural networks in TensorFlow 2.0 with 5x less memory
☆137Feb 21, 2022Updated 4 years ago
bkushigian / cornelius
View on GitHub
Equivalent and redundant mutant detection with e-graphs!!!
☆13Jun 14, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
netx-repo / PipeSwitch
View on GitHub
PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications
☆127May 9, 2022Updated 4 years ago
spcl / substation
View on GitHub
Research and development for optimizing transformers
☆132Feb 16, 2021Updated 5 years ago
linnanwang / superneurons-release
View on GitHub
this is the release repository of superneurons
☆54Feb 13, 2021Updated 5 years ago
TonyTangYu / pytorch
View on GitHub
DELTA-pytorch：DELTA: Dynamically Optimizing GPU Memory beyond Tensor Recomputation
☆12Apr 16, 2024Updated 2 years ago
SymbioticLab / Salus
View on GitHub
Fine-grained GPU sharing primitives
☆149Jul 28, 2025Updated 11 months ago
uwsampl / relay-aot
View on GitHub
An experimental ahead of time compiler for Relay.
☆49Apr 21, 2020Updated 6 years ago
mangpo / swizzle-inventor
View on GitHub
A framework that helps implementing swizzle GPU kernels
☆50Feb 29, 2020Updated 6 years ago
thu-pacman / PET
View on GitHub
PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections
☆126Jun 23, 2022Updated 4 years ago
tlc-pack / relax
View on GitHub
☆193Mar 28, 2023Updated 3 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
topal-team / rockmate
View on GitHub
☆37Mar 12, 2026Updated 4 months ago
utsaslab / MONeT
View on GitHub
MONeT framework for reducing memory consumption of DNN training
☆174May 4, 2021Updated 5 years ago
jkehne / cuda-malloc-hook
View on GitHub
Drop-in library for tracking the memory allocations of CUDA applications
☆14Nov 17, 2017Updated 8 years ago
feifeibear / PyTorchMemTracer
View on GitHub
Depict GPU memory footprint during DNN training of PyTorch
☆11Nov 17, 2022Updated 3 years ago
lsrcz / mini-prover
View on GitHub
Term project for TaPL. A mini coq-like proof assistant.
☆17Jun 17, 2018Updated 8 years ago
kaiyuyue / torchshard
View on GitHub
Slicing a PyTorch Tensor Into Parallel Shards
☆300Jun 7, 2025Updated last year
facebookresearch / fairring
View on GitHub
Fairring (FAIR + Herring) is a plug-in for PyTorch that provides a process group for distributed training that outperforms NCCL at large …
☆66Mar 21, 2022Updated 4 years ago
raceconditionrunning / raceconditionrunning.github.io
View on GitHub
Race Condition Running
☆11Updated this week
GeeeekExplorer / kkbot
View on GitHub
A Feishu/Lark AI agent bot
☆15Feb 27, 2026Updated 4 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
shriramsb / vdnn-plus-plus
View on GitHub
Implementation of vDNN++; an improvement over vDNN
☆18Dec 7, 2018Updated 7 years ago
nox-410 / tvm.tl
View on GitHub
An extention of TVMScript to write simple and high performance GPU kernels with tensorcore.
☆52Jul 23, 2024Updated last year
tobegit3hub / tftvm
View on GitHub
TensorFlow and TVM integration
☆36Apr 27, 2020Updated 6 years ago
YukeWang96 / MGG_OSDI23
View on GitHub
Artifact for OSDI'23: MGG: Accelerating Graph Neural Networks with Fine-grained intra-kernel Communication-Computation Pipelining on Mult…
☆40Mar 17, 2024Updated 2 years ago
ucbrise / actnn
View on GitHub
ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training
☆199Dec 22, 2022Updated 3 years ago
ryantd / veloce
View on GitHub
WIP. Veloce is a low-code Ray-based parallelization library that makes machine learning computation novel, efficient, and heterogeneous.
☆17Aug 4, 2022Updated 3 years ago
mit-han-lab / inter-operator-scheduler
View on GitHub
[MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration
☆201Apr 27, 2022Updated 4 years ago
hcho3 / relayviz
View on GitHub
Visualize TVM Relay program graph
☆12Nov 19, 2019Updated 6 years ago
ShishirPatil / poet
View on GitHub
ML model training for edge devices
☆170Sep 29, 2023Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Tencent / PatrickStar
View on GitHub
PatrickStar enables Larger, Faster, Greener Pretrained Models for NLP and democratizes AI for everyone.
☆773Nov 18, 2025Updated 8 months ago
UofT-EcoSystem / hfta
View on GitHub
Boost hardware utilization for ML training workloads via Inter-model Horizontal Fusion
☆32May 15, 2024Updated 2 years ago
jiazhihao / TASO
View on GitHub
The Tensor Algebra SuperOptimizer for Deep Learning
☆742Jan 26, 2023Updated 3 years ago
zhuohan123 / terapipe
View on GitHub
☆79May 4, 2021Updated 5 years ago
amirgholami / ai_and_memory_wall
View on GitHub
AI and Memory Wall
☆228Mar 23, 2024Updated 2 years ago
HPCRL / ASPLOS_artifact
View on GitHub
☆13Nov 1, 2021Updated 4 years ago
quiver-team / quiver-feature
View on GitHub
High performance RDMA-based distributed feature collection component for training GNN model on EXTREMELY large graph
☆55Jul 3, 2022Updated 4 years ago