☆41Jun 18, 2021Updated 4 years ago
Alternatives and similar repositories for OptimalGradCheckpointing
Users that are interested in OptimalGradCheckpointing are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PSTensor provides a way to hack the memory management of tensors in TensorFlow and PyTorch by defining your own C++ Tensor Class.☆10Feb 10, 2022Updated 4 years ago
- Code associated with the paper **SkipBERT: Efficient Inference with Shallow Layer Skipping**, at ACL 2022☆16Jun 22, 2022Updated 3 years ago
- An external memory allocator example for PyTorch.☆16Aug 10, 2025Updated 9 months ago
- Training neural networks in TensorFlow 2.0 with 5x less memory☆137Feb 21, 2022Updated 4 years ago
- 面试常见知识点整理☆11Oct 13, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- MONeT framework for reducing memory consumption of DNN training☆174May 4, 2021Updated 5 years ago
- Successfully training approximations to full-rank matrices for efficiency in deep learning.☆16Jan 5, 2021Updated 5 years ago
- ☆19Jan 27, 2021Updated 5 years ago
- ☆30Sep 4, 2023Updated 2 years ago
- NAS Benchmark in "Prioritized Architecture Sampling with Monto-Carlo Tree Search", CVPR2021☆38Aug 24, 2021Updated 4 years ago
- Artifact for 'Register Optimizations for Stencils on GPUs'☆10Sep 18, 2018Updated 7 years ago
- Single RISC-V CPU attached on AMBA AHB with Instruction and Data memories.☆13Apr 18, 2026Updated last month
- Description and applications of OpenAI's paper about DALL-E (2021) and implementation of other (CLIP-guided) zero-shot text-to-image gene…☆33Aug 11, 2022Updated 3 years ago
- PolyMage is a domain-specific language and optimizing code generator for auto-parallelisation☆14Jul 15, 2016Updated 9 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Fast Axiomatic Attribution for Neural Networks (NeurIPS*2021)☆15Feb 24, 2026Updated 2 months ago
- Universal Python binding for the LMDB 'Lightning' Database☆13Nov 7, 2017Updated 8 years ago
- This code is for our ICML 2020 paper "On the Number of Linear Regions of Convolutional Neural Networks."☆13Aug 5, 2020Updated 5 years ago
- Just another yolo variant.☆24Jul 8, 2022Updated 3 years ago
- The official implementation of "DOTS: Decoupling Operation and Topology in Differentiable Architecture Search"☆20Apr 19, 2021Updated 5 years ago
- PipeTransformer: Automated Elastic Pipelining for Distributed Training of Large-scale Models. ICML 2021☆56Jul 21, 2021Updated 4 years ago
- Our implementation of Shampoo optimizer based on https://arxiv.org/pdf/1802.09568.pdf☆14Dec 23, 2019Updated 6 years ago
- We introduce FixEval , a dataset for competitive programming bug fixing along with a comprehensive test suite and show the necessity of e…☆26Aug 31, 2022Updated 3 years ago
- ☆14Oct 8, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [ACM MM 2023] Boosting Few-shot 3D Point Cloud Segmentation via Query-Guided Enhancement☆13May 17, 2024Updated 2 years ago
- Depict GPU memory footprint during DNN training of PyTorch☆11Nov 17, 2022Updated 3 years ago
- RF^2 is a federated recommendation learning simulation framework that can simulate realistic system-induced data heterogeneity and its ef…☆13Aug 2, 2022Updated 3 years ago
- The jiant toolkit for general-purpose text understanding models☆22Oct 8, 2020Updated 5 years ago
- GPT Demo with hybrid distributed training☆10Dec 1, 2022Updated 3 years ago
- ☆12Jun 28, 2021Updated 4 years ago
- Pytorch wrappers for the FINUFFT library☆16Nov 21, 2025Updated 5 months ago
- ☆11Jun 2, 2021Updated 4 years ago
- TaskMet Task-driven Metric Learning for Model Learning☆19Feb 9, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Unified Language-driven Zero-shot Domain Adaptation (CVPR 2024)☆17Nov 28, 2024Updated last year
- NO-tifications: remove any notifications on Android☆18Jul 8, 2024Updated last year
- The code for performing MTL on object recognition with neural data☆16Nov 1, 2021Updated 4 years ago
- Artifact repository for paper Automatic Generation of High-Performance Quantized Machine Learning Kernels☆17Oct 13, 2020Updated 5 years ago
- Convert ANY IR to ONNX format☆28May 9, 2026Updated last week
- ☆64Apr 9, 2024Updated 2 years ago
- Fully-Polarized Ship Detection Dataset☆20Jan 18, 2024Updated 2 years ago