qhliu26 / Dive-into-Big-Model-TrainingView external linksLinks
๐ Dive into Big Model Training
โ116Dec 1, 2022Updated 3 years ago
Alternatives and similar repositories for Dive-into-Big-Model-Training
Users that are interested in Dive-into-Big-Model-Training are comparing it to the libraries listed below
Sorting:
- โ14Aug 29, 2023Updated 2 years ago
- โ15Apr 20, 2022Updated 3 years ago
- FTPipe and related pipeline model parallelism research.โ44May 16, 2023Updated 2 years ago
- Some microbenchmarks and design docs before commencementโ12Feb 1, 2021Updated 5 years ago
- Chaitin-Briggs register-allocation algorithm (LLVM back-end)โ12Jan 6, 2016Updated 10 years ago
- Julia implementation of flash-attention operation for neural networks.โ11May 31, 2023Updated 2 years ago
- ๐น Implementation for the lesson Compiling Engineering(2020 Spring) in Peking University, adjusted from UCLA CS 132 Project.โ10Jun 21, 2020Updated 5 years ago
- โ10May 16, 2021Updated 4 years ago
- Paper list for accleration of transformersโ14Jul 1, 2023Updated 2 years ago
- Simple PyTorch profiler that combines DeepSpeed Flops Profiler and TorchInfoโ11Feb 12, 2023Updated 3 years ago
- โ27May 31, 2023Updated 2 years ago
- Code associated with the paper **Fine-tuning Language Models over Slow Networks using Activation Compression with Guarantees**.โ28Apr 25, 2023Updated 2 years ago
- "Moshpit SGD: Communication-Efficient Decentralized Training on Heterogeneous Unreliable Devices", official implementationโ30Feb 4, 2025Updated last year
- A schedule language for large model trainingโ152Aug 21, 2025Updated 5 months ago
- โ14Mar 29, 2020Updated 5 years ago
- Experiments for the NeurIPS 2021 paper "Cockpit: A Practical Debugging Tool for the Training of Deep Neural Networks"โ13Oct 25, 2021Updated 4 years ago
- A Sparse-tensor Communication Framework for Distributed Deep Learningโ13Nov 1, 2021Updated 4 years ago
- A Generic Resource-Aware Hyperparameter Tuning Execution Engineโ15Jan 8, 2022Updated 4 years ago
- Pokedex for LLMsโ14Apr 14, 2025Updated 10 months ago
- โ17Mar 3, 2025Updated 11 months ago
- ๅบไบFPGAๅฎ็ฐ็จๆทๆไธญๆญ็กฌไปถๆบๅถไธไผๅๆไฝ็ณป็ปๅ ๆ ธโ10Apr 1, 2025Updated 10 months ago
- Distributed DRL by Ray and TensorFlow Tutorial.โ10Dec 26, 2019Updated 6 years ago
- An Attention Superoptimizerโ22Jan 20, 2025Updated last year
- Python environment for Chinese Standard Mahjong on Botzone platform.โ14Jan 18, 2021Updated 5 years ago
- โ16Sep 4, 2023Updated 2 years ago
- Exploring connections between automatic differentiation and smooth infinitesimal analysis, or smooth algebrasโ17Sep 11, 2021Updated 4 years ago
- Framework of pa code for THU compiler principle course.โ13Dec 18, 2019Updated 6 years ago
- Code for Double Blind CollaborativeLearning (DBCL)โ14May 14, 2021Updated 4 years ago
- [ACM SoCC'22] Pisces: Efficient Federated Learning via Guided Asynchronous Trainingโ13Apr 28, 2025Updated 9 months ago
- โ19Feb 15, 2023Updated 3 years ago
- This repository is the official implementation of 'EDEN: Communication-Efficient and Robust Distributed Mean Estimation for Federated Leaโฆโ14Aug 2, 2022Updated 3 years ago
- A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmupโ35Jan 9, 2023Updated 3 years ago
- Code of the COLING22 paper "uChecker: Masked Pretrained Language Models as Unsupervised Chinese Spelling Checkers"โ19Aug 17, 2022Updated 3 years ago
- Fork of diux-dev/imagenet18โ16Oct 4, 2018Updated 7 years ago
- Switch-based Training Acceleration for Machine Learning (SwitchML)โ16Apr 13, 2021Updated 4 years ago
- DeepMatch: Practical Deep Packet Inspection in the Data Plane using Network Processorsโ15Dec 21, 2020Updated 5 years ago
- โ17Jul 5, 2022Updated 3 years ago
- Machine learning on serverless platformโ10Jul 2, 2022Updated 3 years ago
- โ19May 4, 2023Updated 2 years ago