Unakar / Efficient_AILinks
此项目是我个人对MIT 6.5940 课程作业的答案,学习笔记和心得。
☆14Updated last year
Alternatives and similar repositories for Efficient_AI
Users that are interested in Efficient_AI are comparing it to the libraries listed below
Sorting:
- Course materials for MIT6.5940: TinyML and Efficient Deep Learning Computing☆52Updated 7 months ago
- The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)☆269Updated 7 months ago
- ☆54Updated 4 months ago
- ☆141Updated last month
- Puzzles for learning Triton, play it with minimal environment configuration!☆461Updated 8 months ago
- A comprehensive guide for beginners in the field of data management and artificial intelligence.☆393Updated 4 months ago
- All Homeworks for TinyML and Efficient Deep Learning Computing 6.5940 • Fall • 2023 • https://efficientml.ai☆177Updated last year
- Learning material for CMU10-714: Deep Learning System☆268Updated last year
- 飞桨护航计划集训营☆20Updated 2 weeks ago
- A repository sharing the literatures about large language models☆99Updated last month
- Codes & examples for "CUDA - From Correctness to Performance"☆104Updated 10 months ago
- This repository serves as a comprehensive survey of LLM development, featuring numerous research papers along with their corresponding co…☆183Updated 3 weeks ago
- ☆35Updated 3 months ago
- llm theoretical performance analysis tools and support params, flops, memory and latency analysis.☆102Updated last month
- Code release for book "Efficient Training in PyTorch"☆83Updated 4 months ago
- 本仓库是关于大模型面试中常见面试试题和面试经验的整理。这里收集了各类与大模型相关的面试题目,并提供详细的解答和分析。本仓库由上海交大交影社区维护☆100Updated last year
- A tiny yet powerful LLM inference system tailored for researching purpose. vLLM-equivalent performance with only 2k lines of code (2% of …☆240Updated 2 months ago
- Awesome-LLM-KV-Cache: A curated list of 📙Awesome LLM KV Cache Papers with Codes.☆351Updated 5 months ago
- 📰 Must-read papers on KV Cache Compression (constantly updating 🤗).☆513Updated 3 weeks ago
- Implement custom operators in PyTorch with cuda/c++☆69Updated 2 years ago
- The blog, read report and code example for AGI/LLM related knowledge.☆41Updated 6 months ago
- HPC-Lab for High Performance Computing course, 2023 Spring , Tsinghua Universit. 高性能计算导论 @ THU.☆25Updated 2 years ago
- 中科大计算机学院部分课程的试卷☆78Updated 3 weeks ago
- my cs notes☆54Updated 10 months ago
- Triton Documentation in Chinese Simplified / Triton 中文文档☆80Updated 4 months ago
- The dataset and baseline code for ASC23 LLM inference optimization challenge.☆32Updated last year
- A PyTorch-like deep learning framework. Just for fun.☆156Updated last year
- Summary of some awesome work for optimizing LLM inference☆101Updated 2 months ago
- 🤖FFPA: Extend FlashAttention-2 with Split-D, ~O(1) SRAM complexity for large headdim, 1.8x~3x↑🎉 vs SDPA EA.☆211Updated 2 weeks ago
- learning how CUDA works☆300Updated 5 months ago