wyc-ruiker / CSE-599W-2018
My Assignment for CSE 599w http://dlsys.cs.washington.edu/
☆16Updated 5 years ago
Alternatives and similar repositories for CSE-599W-2018:
Users that are interested in CSE-599W-2018 are comparing it to the libraries listed below
- A simple deep learning framework that supports automatic differentiation and GPU acceleration.☆58Updated last year
- Tutorial code on how to build your own Deep Learning System in 2k Lines☆125Updated 8 years ago
- CS294; AI For Systems and Systems For AI☆224Updated 5 years ago
- ☆35Updated last year
- Dynamic Tensor Rematerialization prototype (modified PyTorch) and simulator. Paper: https://arxiv.org/abs/2006.09616☆132Updated last year
- A high-performance distributed deep learning system targeting large-scale and automated distributed training. If you have any interests, …☆110Updated last year
- Code base and slides for ECE408:Applied Parallel Programming On GPU.☆122Updated 3 years ago
- ☆45Updated 5 years ago
- An Efficient Pipelined Data Parallel Approach for Training Large Model☆75Updated 4 years ago
- pytorch源码阅读 0.2.0 版本☆90Updated 5 years ago
- ☆35Updated 2 years ago
- Simple CuDNN wrapper☆30Updated 9 years ago
- Distributed ML Training Benchmarks☆27Updated 2 years ago
- A baseline repository of Auto-Parallelism in Training Neural Networks☆144Updated 2 years ago
- ☆79Updated this week
- A high-performance distributed deep learning system targeting large-scale and automated distributed training.☆298Updated last week
- A way to use cuda to accelerate top k algorithm☆29Updated 7 years ago
- Deep Learning in pure C++☆27Updated 5 years ago
- Google Colab Notebooks for Udacity CS344 - Intro to Parallel Programming☆133Updated 4 years ago
- [MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration☆197Updated 3 years ago
- A PyTorch-like deep learning framework. Just for fun.☆154Updated last year
- [USENIX ATC '24] Accelerating the Training of Large Language Models using Efficient Activation Rematerialization and Optimal Hybrid Paral…☆52Updated 8 months ago
- BytePS examples (Vision, NLP, GAN, etc)☆19Updated 2 years ago
- PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections☆120Updated 2 years ago
- ☆109Updated last year
- Lecture notes of Probability Theory.☆50Updated 6 years ago
- CUDA 6大并行计算模式 代码与笔记☆60Updated 4 years ago
- oneflow documentation☆68Updated 10 months ago
- A super light-weight deep learning library based on NumPy in PyTorch fashion.☆94Updated 3 years ago
- ☆142Updated 2 months ago