MONeT framework for reducing memory consumption of DNN training
☆174May 4, 2021Updated 4 years ago
Alternatives and similar repositories for MONeT
Users that are interested in MONeT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS'20] GradAug: A New Regularization Method for Deep Neural Networks☆93Dec 16, 2020Updated 5 years ago
- ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training☆198Dec 22, 2022Updated 3 years ago
- ☆42Sep 8, 2023Updated 2 years ago
- Haskell experiments involving TVM AI framework☆20Apr 26, 2019Updated 6 years ago
- Training neural networks in TensorFlow 2.0 with 5x less memory☆137Feb 21, 2022Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- PyTorch implementation of L2L execution algorithm☆109Jan 16, 2023Updated 3 years ago
- ☆41Jun 18, 2021Updated 4 years ago
- Dynamic Tensor Rematerialization prototype (modified PyTorch) and simulator. Paper: https://arxiv.org/abs/2006.09616☆133Jul 6, 2023Updated 2 years ago
- ☆23Apr 28, 2023Updated 2 years ago
- The implementation of "Shape Adaptor: A Learnable Resizing Module" [ECCV 2020].☆71Mar 10, 2021Updated 5 years ago
- PyTorch layer-by-layer model profiler☆606May 23, 2021Updated 4 years ago
- Using ideas from product quantization for state-of-the-art neural network compression.☆146Aug 14, 2021Updated 4 years ago
- sensAI: ConvNets Decomposition via Class Parallelism for Fast Inference on Live Data☆66Jul 25, 2024Updated last year
- Lightweight and Parallel Deep Learning Framework☆263Nov 26, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- The code for Joint Neural Architecture Search and Quantization☆14Apr 10, 2019Updated 7 years ago
- Implementation of DropLoss for Long-Tail Instance Segmentation in Pytorch☆42Apr 14, 2021Updated 4 years ago
- Object detection on multiple datasets with an automatically learned unified label space.☆516Mar 8, 2024Updated 2 years ago
- The Tensor Algebra SuperOptimizer for Deep Learning☆740Jan 26, 2023Updated 3 years ago
- [MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration☆200Apr 27, 2022Updated 3 years ago
- Repository for NeurIPS 2020 Spotlight "AdaBelief Optimizer: Adapting stepsizes by the belief in observed gradients"☆1,070Aug 9, 2024Updated last year
- ☆145Jan 30, 2025Updated last year
- a playground for working with fully static tensors and automatic differentiation☆16Mar 18, 2021Updated 5 years ago
- A GPipe implementation in PyTorch☆862Jul 25, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Unofficial PyTorch Implementation of EvoNorm☆123Aug 29, 2021Updated 4 years ago
- ☆16Sep 4, 2023Updated 2 years ago
- Research and development for optimizing transformers☆131Feb 16, 2021Updated 5 years ago
- PyTorch extensions for high performance and large scale training.☆3,404Apr 26, 2025Updated 11 months ago
- Post-training sparsity-aware quantization☆34Feb 26, 2023Updated 3 years ago
- ☆78May 4, 2021Updated 4 years ago
- A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.☆999Sep 19, 2024Updated last year
- Code for "Bridging the Gap between f-GANs and Wasserstein GANs", ICML 2020☆14Jul 18, 2020Updated 5 years ago
- [ICIP 2019] : Official PyTorch implementation of the paper "What's There in The Dark" accepted in IEEE International Conference in Image …☆30Dec 15, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [ECCV 2022] SuperTickets: Drawing Task-Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning☆20Jul 7, 2022Updated 3 years ago
- Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators☆123Oct 26, 2022Updated 3 years ago
- Official Pytorch Implementation of "TResNet: High-Performance GPU-Dedicated Architecture" (WACV 2021)☆478Dec 10, 2024Updated last year
- ☆13Nov 1, 2021Updated 4 years ago
- Alex Graves' Adaptive Computation Time in PyTorch☆14Jan 9, 2018Updated 8 years ago
- ☆24Jun 22, 2022Updated 3 years ago
- ☆41Apr 3, 2021Updated 5 years ago