uw-mad-dash / AutoFreezeLinks
☆14Updated 4 years ago
Alternatives and similar repositories for AutoFreeze
Users that are interested in AutoFreeze are comparing it to the libraries listed below
Sorting:
- ☆22Updated 4 years ago
- DL Dataloader Benchmarks☆18Updated 4 months ago
- Fairring (FAIR + Herring) is a plug-in for PyTorch that provides a process group for distributed training that outperforms NCCL at large …☆65Updated 3 years ago
- ☆15Updated 3 years ago
- PipeTransformer: Automated Elastic Pipelining for Distributed Training of Large-scale Models. ICML 2021☆56Updated 3 years ago
- "Moshpit SGD: Communication-Efficient Decentralized Training on Heterogeneous Unreliable Devices", official implementation☆29Updated 4 months ago
- Code for paper "SWALP: Stochastic Weight Averaging forLow-Precision Training".☆62Updated 6 years ago
- Code for BlockSwap (ICLR 2020).☆33Updated 4 years ago
- [ECCV 2022] SuperTickets: Drawing Task-Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning☆20Updated 2 years ago
- Deadline-based hyperparameter tuning on RayTune.☆31Updated 5 years ago
- Repository to go along with the paper "Plumber: Diagnosing and Removing Performance Bottlenecks in Machine Learning Data Pipelines"☆10Updated 3 years ago
- Memory Optimizations for Deep Learning (ICML 2023)☆64Updated last year
- ☆14Updated 3 years ago
- sensAI: ConvNets Decomposition via Class Parallelism for Fast Inference on Live Data☆64Updated 10 months ago
- SelectiveBackprop accelerates training by dynamically prioritizing useful examples with high loss☆32Updated 5 years ago
- ☆22Updated 7 years ago
- Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012☆50Updated 3 years ago
- Development repository for integrating FlexFlow (A distributed deep learning framework that supports flexible parallelization strategies)…☆29Updated 3 years ago
- Distributed DataLoader For Pytorch Based On Ray☆24Updated 3 years ago
- Light-weight GPU kernel interface for graph operations☆15Updated 5 years ago
- ☆23Updated 6 years ago
- [JMLR'20] NeurIPS 2019 MicroNet Challenge Efficient Language Modeling, Champion☆40Updated 4 years ago
- A Learnable LSH Framework for Efficient NN Training☆31Updated 3 years ago
- Code for Large Graph Convolutional Network Training with GPU-Oriented Data Communication Architecture (accepted by PVLDB).The outdated wr…☆9Updated last year
- Implementation for ACProp ( Momentum centering and asynchronous update for adaptive gradient methdos, NeurIPS 2021)☆15Updated 3 years ago
- Distributed ML Optimizer☆32Updated 3 years ago
- DLPack for Tensorflow☆35Updated 5 years ago
- A study of the downstream instability of word embeddings☆12Updated 2 years ago
- PyTorch implementation of L2L execution algorithm☆107Updated 2 years ago
- An Attention Superoptimizer☆21Updated 4 months ago