📑 Dive into Big Model Training
☆115Dec 1, 2022Updated 3 years ago
Alternatives and similar repositories for Dive-into-Big-Model-Training
Users that are interested in Dive-into-Big-Model-Training are comparing it to the libraries listed below
Sorting:
- ☆14Aug 29, 2023Updated 2 years ago
- FTPipe and related pipeline model parallelism research.☆44May 16, 2023Updated 2 years ago
- Some microbenchmarks and design docs before commencement☆12Feb 1, 2021Updated 5 years ago
- Primo: Practical Learning-Augmented Systems with Interpretable Models☆19Dec 26, 2023Updated 2 years ago
- Federated reconnaissance mini-ImageNet benchmark and baseline models☆13Sep 2, 2021Updated 4 years ago
- 🕹 Implementation for the lesson Compiling Engineering(2020 Spring) in Peking University, adjusted from UCLA CS 132 Project.☆10Jun 21, 2020Updated 5 years ago
- Julia implementation of flash-attention operation for neural networks.☆11May 31, 2023Updated 2 years ago
- ☆10May 16, 2021Updated 4 years ago
- "Moshpit SGD: Communication-Efficient Decentralized Training on Heterogeneous Unreliable Devices", official implementation☆30Feb 4, 2025Updated last year
- ☆27May 31, 2023Updated 2 years ago
- A schedule language for large model training☆152Aug 21, 2025Updated 6 months ago
- ☆14Mar 29, 2020Updated 5 years ago
- Experiments for the NeurIPS 2021 paper "Cockpit: A Practical Debugging Tool for the Training of Deep Neural Networks"☆13Oct 25, 2021Updated 4 years ago
- A Sparse-tensor Communication Framework for Distributed Deep Learning☆13Nov 1, 2021Updated 4 years ago
- ComScribe is a tool to identify communication among all GPU-GPU and CPU-GPU pairs in a single-node multi-GPU system.☆27Jul 6, 2023Updated 2 years ago
- A Generic Resource-Aware Hyperparameter Tuning Execution Engine☆15Jan 8, 2022Updated 4 years ago
- Distributed DRL by Ray and TensorFlow Tutorial.☆10Dec 26, 2019Updated 6 years ago
- [ICML'25] Kernel-based Unsupervised Embedding Alignment for Enhanced Visual Representation in Vision-language Models☆21Sep 7, 2025Updated 6 months ago
- 基于FPGA实现用户态中断硬件机制与优化操作系统内核☆10Apr 1, 2025Updated 11 months ago
- An Attention Superoptimizer☆22Jan 20, 2025Updated last year
- ☆16Sep 4, 2023Updated 2 years ago
- This repository is the official implementation of 'EDEN: Communication-Efficient and Robust Distributed Mean Estimation for Federated Lea…☆14Aug 2, 2022Updated 3 years ago
- [ACM SoCC'22] Pisces: Efficient Federated Learning via Guided Asynchronous Training☆13Apr 28, 2025Updated 10 months ago
- ☆19Feb 15, 2023Updated 3 years ago
- Code for Double Blind CollaborativeLearning (DBCL)☆14May 14, 2021Updated 4 years ago
- Framework of pa code for THU compiler principle course.☆13Dec 18, 2019Updated 6 years ago
- A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup☆36Jan 9, 2023Updated 3 years ago
- MLCD-Seg is a zero-shot segmentation model from DeepGlint.☆17Jul 4, 2025Updated 8 months ago
- Fork of diux-dev/imagenet18☆16Oct 4, 2018Updated 7 years ago
- Switch-based Training Acceleration for Machine Learning (SwitchML)☆16Apr 13, 2021Updated 4 years ago
- ☆17Jul 5, 2022Updated 3 years ago
- Code of the COLING22 paper "uChecker: Masked Pretrained Language Models as Unsupervised Chinese Spelling Checkers"☆19Aug 17, 2022Updated 3 years ago
- Machine learning on serverless platform☆10Jul 2, 2022Updated 3 years ago
- SOTA Learning-augmented Systems☆37May 21, 2022Updated 3 years ago
- egraphs-good website☆18Oct 9, 2024Updated last year
- Mu: Microsecond Consensus for Microsecond Applications☆42Oct 12, 2020Updated 5 years ago
- Federated Few-shot Learning for Mobile NLP. Conditionally accepted by MobiCom'23.☆16Aug 18, 2023Updated 2 years ago
- Statistics on multilingual datasets☆17Jul 12, 2022Updated 3 years ago
- MATCH-TUNING☆15Aug 6, 2022Updated 3 years ago