Unakar / Efficient_AILinks
此项目是我个人对MIT 6.5940 课程作业的答案,学习笔记和心得。
☆14Updated last year
Alternatives and similar repositories for Efficient_AI
Users that are interested in Efficient_AI are comparing it to the libraries listed below
Sorting:
- Course materials for MIT6.5940: TinyML and Efficient Deep Learning Computing☆59Updated 8 months ago
- The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)☆278Updated 8 months ago
- Learning material for CMU10-714: Deep Learning System☆276Updated last year
- Codes & examples for "CUDA - From Correctness to Performance"☆111Updated 11 months ago
- 飞桨护航计划集训营☆21Updated last month
- A repository sharing the literatures about large language models☆102Updated 2 months ago
- ☆77Updated this week
- All Homeworks for TinyML and Efficient Deep Learning Computing 6.5940 • Fall • 2023 • https://efficientml.ai☆181Updated last year
- A comprehensive guide for beginners in the field of data management and artificial intelligence.☆433Updated 5 months ago
- ☆143Updated 2 months ago
- Puzzles for learning Triton, play it with minimal environment configuration!☆518Updated this week
- The dataset and baseline code for ASC23 LLM inference optimization challenge.☆32Updated last year
- 🏆🏆 「大模型」All in one & All from scratch. 🌍🌍 收集、清洗数据,训练Tokenizer,预训练、SFT、GRPO!☆29Updated last month
- 中科大计算机学院部分课程的试卷☆76Updated 2 months ago
- My solutions to the assignments of CMU 10-714 Deep Learning Systems 2022☆40Updated last year
- Implement custom operators in PyTorch with cuda/c++☆71Updated 2 years ago
- This repository serves as a comprehensive survey of LLM development, featuring numerous research papers along with their corresponding co…☆205Updated last month
- Awesome-LLM-KV-Cache: A curated list of 📙Awesome LLM KV Cache Papers with Codes.☆364Updated 6 months ago
- This repository organizes materials, recordings, and schedules related to AI-infra learning meetings.☆155Updated this week
- my cs notes☆56Updated 11 months ago
- A single-file educational implementation for understanding vLLM's core concepts and running LLM inference.☆22Updated 3 months ago
- Sharing my research toolchain☆85Updated last year
- 简单的Mac中文指南☆22Updated 2 years ago
- Curated collection of papers in MoE model inference☆265Updated last week
- 本仓库是关于大模型面试中常见面试试题和面试经验的整理。这里收集了各类与大模型相关的面试题目,并提供详细的解答和分析。本仓库由上海交大交影社区维护☆103Updated last year
- 🔥 How to efficiently and effectively compress the CoTs or directly generate concise CoTs during inference while maintaining the reasonin…☆62Updated 4 months ago
- Summary of some awesome work for optimizing LLM inference☆110Updated 3 months ago
- Lab 5 project of MIT-6.5940, deploying LLaMA2-7B-chat on one's laptop with TinyChatEngine.☆18Updated last year
- 注释的nano_vllm仓库,并且完成了MiniCPM4的适配以及注册新模型的功能☆75Updated last month
- llm theoretical performance analysis tools and support params, flops, memory and latency analysis.☆107Updated 2 months ago