Unakar / Efficient_AILinks

此项目是我个人对MIT 6.5940 课程作业的答案，学习笔记和心得。

☆14

Alternatives and similar repositories for Efficient_AI

Users that are interested in Efficient_AI are comparing it to the libraries listed below

Sorting:

PKUFlyingPig / MIT6.5940_TinyML
Course materials for MIT6.5940: TinyML and Efficient Deep Learning Computing
☆47Updated 5 months ago
MLSys-Learner-Resources / Awesome-MLSys-Blogger
The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)
☆245Updated 5 months ago
NJUDeepEngine / llm-course-lecture
☆54Updated 2 months ago
WesKwong / FLMMS
Federated Learning Multi-Machine Simulator: A Docker-based federated learning framework for simulating multi-machine training
☆9Updated last year
interestingLSY / CUDA-From-Correctness-To-Performance-Code
Codes & examples for "CUDA - From Correctness to Performance"
☆100Updated 8 months ago
sast-summer-training-2023 / sast-summer-training-2023.github.io
Summer Training 2023, SAST 9.
☆42Updated last year
PKU-DAIR / Starter-Guide
A comprehensive guide for beginners in the field of data management and artificial intelligence.
☆302Updated 2 months ago
mdy666 / mdy_triton
☆137Updated last month
Strivin0311 / llms-learning
A repository sharing the literatures about large language models
☆94Updated 3 weeks ago
SiriusNEO / Triton-Puzzles-Lite
Puzzles for learning Triton, play it with minimal environment configuration!
☆367Updated 6 months ago
lcpu-club / hpc-wiki
Wiki fo HPC
☆114Updated 5 months ago
ThisisBillhe / ZipCache
[NeurIPS 2024] The official implementation of ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token Identification
☆22Updated 2 months ago
Joining-AI / LLM_Interview_Prepare
本仓库是关于大模型面试中常见面试试题和面试经验的整理。这里收集了各类与大模型相关的面试题目，并提供详细的解答和分析。本仓库由上海交大交影社区维护
☆95Updated 10 months ago
mit-han-lab / Block-Sparse-Attention
A sparse attention kernel supporting mix sparse patterns
☆238Updated 4 months ago
hyperai / triton-cn
Triton Documentation in Chinese Simplified / Triton 中文文档
☆71Updated 2 months ago
TreeAI-Lab / Awesome-KV-Cache-Management
This repository serves as a comprehensive survey of LLM development, featuring numerous research papers along with their corresponding co…
☆154Updated last week
ZonePG / cs-notes
my cs notes
☆51Updated 8 months ago
mit-han-lab / x-attention
XAttention: Block Sparse Attention with Antidiagonal Scoring
☆166Updated this week
mdy666 / Qwen-Native-Sparse-Attention
qwen-nsa
☆67Updated 2 months ago
JerryYin777 / Cr_Research_Toolchain
Sharing my research toolchain
☆84Updated last year
interestingLSY / swiftLLM
A tiny yet powerful LLM inference system tailored for researching purpose. vLLM-equivalent performance with only 2k lines of code (2% of …
☆224Updated 2 weeks ago
TonyCrane / slide-template
TonyCrane's slide template for reveal-md
☆99Updated 11 months ago
October2001 / Awesome-KV-Cache-Compression
📰 Must-read papers on KV Cache Compression (constantly updating 🤗).
☆459Updated this week
luliyucoordinate / cute-flash-attention
Implement Flash Attention using Cute.
☆87Updated 6 months ago
PKUFlyingPig / CMU10-714
Learning material for CMU10-714: Deep Learning System
☆256Updated last year
hhnqqq / MyTransformers
Personal Transformer models training library
☆22Updated this week
NVlabs / Fast-dLLM
Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"
☆233Updated 2 weeks ago
kcxain / dlsys
My solutions to the assignments of CMU 10-714 Deep Learning Systems 2022
☆38Updated last year
KaiLv69 / DuoDecoding
DuoDecoding: Hardware-aware Heterogeneous Speculative Decoding with Dynamic Multi-Sequence Drafting
☆15Updated 3 months ago
star-hengxing / cs149-xmake
CS149 xmake version
☆41Updated last year