☆41Jun 18, 2021Updated 4 years ago
Alternatives and similar repositories for OptimalGradCheckpointing
Users that are interested in OptimalGradCheckpointing are comparing it to the libraries listed below
Sorting:
- PSTensor provides a way to hack the memory management of tensors in TensorFlow and PyTorch by defining your own C++ Tensor Class.☆10Feb 10, 2022Updated 4 years ago
- Universal Python binding for the LMDB 'Lightning' Database☆13Nov 7, 2017Updated 8 years ago
- 面试常见知识点整理☆11Oct 13, 2019Updated 6 years ago
- An external memory allocator example for PyTorch.☆16Aug 10, 2025Updated 6 months ago
- Our implementation of Shampoo optimizer based on https://arxiv.org/pdf/1802.09568.pdf☆12Dec 23, 2019Updated 6 years ago
- Successfully training approximations to full-rank matrices for efficiency in deep learning.☆17Jan 5, 2021Updated 5 years ago
- Fast Axiomatic Attribution for Neural Networks (NeurIPS*2021)☆16Updated this week
- MONeT framework for reducing memory consumption of DNN training☆174May 4, 2021Updated 4 years ago
- Research and development for optimizing transformers☆131Feb 16, 2021Updated 5 years ago
- [ICLR 2022] "Unified Vision Transformer Compression" by Shixing Yu*, Tianlong Chen*, Jiayi Shen, Huan Yuan, Jianchao Tan, Sen Yang, Ji Li…☆55Dec 1, 2023Updated 2 years ago
- A pytorch implementation of yolov3☆24Mar 25, 2019Updated 6 years ago
- Here we collect trick questions and failed tasks for open source LLMs to improve them.☆32Apr 20, 2023Updated 2 years ago
- PipeTransformer: Automated Elastic Pipelining for Distributed Training of Large-scale Models. ICML 2021☆55Jul 21, 2021Updated 4 years ago
- TVMScript kernel for deformable attention☆25Dec 15, 2021Updated 4 years ago
- Compares the DistilBERT and MobileBERT architectures for mobile deployments.☆33Oct 15, 2020Updated 5 years ago
- ☆37May 28, 2023Updated 2 years ago
- A Learnable LSH Framework for Efficient NN Training☆34Jul 22, 2021Updated 4 years ago
- Training neural networks in TensorFlow 2.0 with 5x less memory☆137Feb 21, 2022Updated 4 years ago
- PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections☆125Jun 23, 2022Updated 3 years ago
- Fully-Polarized Ship Detection Dataset☆19Jan 18, 2024Updated 2 years ago
- 基于老年人互助养老模式的时间银行系统研究(程成)☆10Nov 18, 2014Updated 11 years ago
- Dynamic Tensor Rematerialization prototype (modified PyTorch) and simulator. Paper: https://arxiv.org/abs/2006.09616☆133Jul 6, 2023Updated 2 years ago
- NAS Benchmark in "Prioritized Architecture Sampling with Monto-Carlo Tree Search", CVPR2021☆37Aug 24, 2021Updated 4 years ago
- ☆43Jan 30, 2024Updated 2 years ago
- ☆11Apr 8, 2024Updated last year
- ☆10Jul 23, 2019Updated 6 years ago
- Identification of the Adversary from a Single Adversarial Example (ICML 2023)☆10Jul 15, 2024Updated last year
- ☆10May 18, 2024Updated last year
- Code for experiments on self-prediction as a way to measure introspection in LLMs☆16Dec 10, 2024Updated last year
- ☆13Updated this week
- Punch Out Model Synthesis - a program for constraint based tiling generation☆19Feb 1, 2026Updated last month
- Superpixels through Iterative CLEarcutting (SICLE) framework☆10Aug 25, 2023Updated 2 years ago
- A smartphone specs API powered with the most trusted phone information website gsm arena.☆16Feb 1, 2024Updated 2 years ago
- A bot that do auto search and gain points☆10Nov 2, 2023Updated 2 years ago
- Tools to cluster visually similar images into groups in an image dataset☆11Jul 29, 2022Updated 3 years ago
- ☆10Aug 15, 2022Updated 3 years ago
- Software library RLCM (recursively low-rank compressed matrices)☆14Apr 15, 2021Updated 4 years ago
- derived from https://github.com/wilfredinni/python-cheatsheet☆10Nov 8, 2023Updated 2 years ago
- A simple baseline for Person ReID, it achieves 3rd place in VisDA2020 challenge.☆38Aug 21, 2020Updated 5 years ago