MONeT framework for reducing memory consumption of DNN training
☆174May 4, 2021Updated 5 years ago
Alternatives and similar repositories for MONeT
Users that are interested in MONeT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS'20] GradAug: A New Regularization Method for Deep Neural Networks☆93Dec 16, 2020Updated 5 years ago
- ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training☆199Dec 22, 2022Updated 3 years ago
- ☆42Sep 8, 2023Updated 2 years ago
- Haskell experiments involving TVM AI framework☆20Apr 26, 2019Updated 7 years ago
- Training neural networks in TensorFlow 2.0 with 5x less memory☆137Feb 21, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- PyTorch implementation of L2L execution algorithm☆109Jan 16, 2023Updated 3 years ago
- ☆41Jun 18, 2021Updated 4 years ago
- Dynamic Tensor Rematerialization prototype (modified PyTorch) and simulator. Paper: https://arxiv.org/abs/2006.09616☆133Jul 6, 2023Updated 2 years ago
- ☆23Apr 28, 2023Updated 3 years ago
- The implementation of "Shape Adaptor: A Learnable Resizing Module" [ECCV 2020].☆71Mar 10, 2021Updated 5 years ago
- PyTorch layer-by-layer model profiler☆606May 23, 2021Updated 5 years ago
- Using ideas from product quantization for state-of-the-art neural network compression.☆146Aug 14, 2021Updated 4 years ago
- sensAI: ConvNets Decomposition via Class Parallelism for Fast Inference on Live Data☆65Jul 25, 2024Updated last year
- Lightweight and Parallel Deep Learning Framework☆262Nov 26, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- The code for Joint Neural Architecture Search and Quantization☆14Apr 10, 2019Updated 7 years ago
- Implementation of DropLoss for Long-Tail Instance Segmentation in Pytorch☆44Apr 14, 2021Updated 5 years ago
- Object detection on multiple datasets with an automatically learned unified label space.☆516Mar 8, 2024Updated 2 years ago
- The Tensor Algebra SuperOptimizer for Deep Learning☆743Jan 26, 2023Updated 3 years ago
- Implementation for ACProp ( Momentum centering and asynchronous update for adaptive gradient methdos, NeurIPS 2021)☆17Oct 11, 2021Updated 4 years ago
- [MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration☆201Apr 27, 2022Updated 4 years ago
- Repository for NeurIPS 2020 Spotlight "AdaBelief Optimizer: Adapting stepsizes by the belief in observed gradients"☆1,072Aug 9, 2024Updated last year
- ☆144Jan 30, 2025Updated last year
- a playground for working with fully static tensors and automatic differentiation☆16Mar 18, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A GPipe implementation in PyTorch☆864Jul 25, 2024Updated last year
- Unofficial PyTorch Implementation of EvoNorm☆123Aug 29, 2021Updated 4 years ago
- ☆16Sep 4, 2023Updated 2 years ago
- Research and development for optimizing transformers☆132Feb 16, 2021Updated 5 years ago
- PyTorch extensions for high performance and large scale training.☆3,407Apr 26, 2025Updated last year
- Post-training sparsity-aware quantization☆34Feb 26, 2023Updated 3 years ago
- ☆78May 4, 2021Updated 5 years ago
- A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.☆1,001Sep 19, 2024Updated last year
- Code for "Bridging the Gap between f-GANs and Wasserstein GANs", ICML 2020☆14Jul 18, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ECCV 2022] SuperTickets: Drawing Task-Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning☆20Jul 7, 2022Updated 3 years ago
- Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators☆124Oct 26, 2022Updated 3 years ago
- Official Pytorch Implementation of "TResNet: High-Performance GPU-Dedicated Architecture" (WACV 2021)☆478Dec 10, 2024Updated last year
- ☆13Nov 1, 2021Updated 4 years ago
- Alex Graves' Adaptive Computation Time in PyTorch☆14Jan 9, 2018Updated 8 years ago
- ☆24Jun 22, 2022Updated 3 years ago
- ☆41Apr 3, 2021Updated 5 years ago