dlsyscourse / hw2Links
☆8Updated 7 months ago
Alternatives and similar repositories for hw2
Users that are interested in hw2 are comparing it to the libraries listed below
Sorting:
- 使用 CUDA C++ 实现的 llama 模型推理框架☆57Updated 6 months ago
- Triton Documentation in Chinese Simplified / Triton 中文文档☆71Updated last month
- A practical way of learning Swizzle☆20Updated 4 months ago
- A PyTorch-like deep learning framework. Just for fun.☆154Updated last year
- Course materials for MIT6.5940: TinyML and Efficient Deep Learning Computing☆46Updated 4 months ago
- ☆23Updated 3 weeks ago
- Machine Learning Compiler Road Map☆43Updated last year
- llm theoretical performance analysis tools and support params, flops, memory and latency analysis.☆92Updated last week
- A light llama-like llm inference framework based on the triton kernel.☆122Updated this week
- some hpc project for learning☆22Updated 9 months ago
- 【HACKATHON 预备营】飞桨启航计划集训营☆16Updated last week
- ☆131Updated last month
- 分层解耦的深度学习推理引擎☆73Updated 3 months ago
- 飞桨护航计划集训营☆18Updated 2 weeks ago
- ☆134Updated last year
- my cs notes☆50Updated 7 months ago
- ☆276Updated 7 months ago
- Codes & examples for "CUDA - From Correctness to Performance"☆98Updated 7 months ago
- ☆14Updated 9 months ago
- My solutions to the assignments of CMU 10-714 Deep Learning Systems 2022☆37Updated last year
- Tutorials for writing high-performance GPU operators in AI frameworks.☆130Updated last year
- ☆238Updated 3 months ago
- b站上的课程☆75Updated last year
- deep learning framework from scratch☆28Updated 3 years ago
- [USENIX ATC '24] Accelerating the Training of Large Language Models using Efficient Activation Rematerialization and Optimal Hybrid Paral…☆55Updated 10 months ago
- Code base and slides for ECE408:Applied Parallel Programming On GPU.☆124Updated 3 years ago
- Summary of some awesome work for optimizing LLM inference☆73Updated this week
- ☆34Updated last year
- Decoding Attention is specially optimized for MHA, MQA, GQA and MLA using CUDA core for the decoding stage of LLM inference.☆36Updated 2 months ago
- ☆70Updated 2 years ago