dlsyscourse / hw1Links
☆8Updated 8 months ago
Alternatives and similar repositories for hw1
Users that are interested in hw1 are comparing it to the libraries listed below
Sorting:
- ☆34Updated last year
- [USENIX ATC '24] Accelerating the Training of Large Language Models using Efficient Activation Rematerialization and Optimal Hybrid Paral…☆55Updated 10 months ago
- GPTQ inference TVM kernel☆40Updated last year
- llm theoretical performance analysis tools and support params, flops, memory and latency analysis.☆92Updated last week
- Explore Inter-layer Expert Affinity in MoE Model Inference☆9Updated last year
- A lightweight design for computation-communication overlap.☆132Updated last month
- A practical way of learning Swizzle☆20Updated 4 months ago
- ☆21Updated last month
- Simple PyTorch graph capturing.☆19Updated 2 years ago
- Official repository for the paper DynaPipe: Optimizing Multi-task Training through Dynamic Pipelines☆19Updated last year
- Machine Learning Compiler Road Map☆43Updated last year
- Examples and exercises from the book Programming Massively Parallel Processors - A Hands-on Approach. David B. Kirk and Wen-mei W. Hwu (T…