pittisl / Generative-AI-TutorialLinks
A subjective learning guide for generative AI research
β88Updated last year
Alternatives and similar repositories for Generative-AI-Tutorial
Users that are interested in Generative-AI-Tutorial are comparing it to the libraries listed below
Sorting:
- Survey Paper List - Efficient LLM and Foundation Modelsβ258Updated last year
- a curated list of high-quality papers on resource-efficient LLMs π±β150Updated 9 months ago
- β102Updated last year
- Systems for GenAIβ148Updated 8 months ago
- [TKDE'25] The official GitHub page for the survey paper "A Survey on Mixture of Experts in Large Language Models".β464Updated 4 months ago
- Awesome-LLM-KV-Cache: A curated list of πAwesome LLM KV Cache Papers with Codes.β401Updated 9 months ago
- Federated Learning Systems Paper Listβ75Updated last year
- All Homeworks for TinyML and Efficient Deep Learning Computing 6.5940 β’ Fall β’ 2023 β’ https://efficientml.aiβ188Updated 2 years ago
- Curated collection of papers in MoE model inferenceβ314Updated 2 months ago
- [ICMLβ24] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".β120Updated 5 months ago
- A curated list of early exiting (LLM, CV, NLP, etc)β69Updated last year
- β611Updated 7 months ago
- This repository serves as a comprehensive survey of LLM development, featuring numerous research papers along with their corresponding coβ¦β258Updated 2 weeks ago
- Since the emergence of chatGPT in 2022, the acceleration of Large Language Model has become increasingly important. Here is a list of papβ¦β282Updated 9 months ago
- paper and its code for AI Systemβ341Updated last week
- Official Repo for "SplitQuant / LLM-PQ: Resource-Efficient LLM Offline Serving on Heterogeneous GPUs via Phase-Aware Model Partition and β¦β35Updated 3 months ago
- Official implementation for Yuan & Liu & Zhong et al., KV Cache Compression, But What Must We Give in Return? A Comprehensive Benchmark oβ¦β87Updated 9 months ago
- [ICLR 2025] PEARL: Parallel Speculative Decoding with Adaptive Draft Lengthβ137Updated last month
- Course materials for MIT6.5940: TinyML and Efficient Deep Learning Computingβ63Updated 11 months ago
- π° Must-read papers on KV Cache Compression (constantly updating π€).β622Updated 2 months ago
- [NeurIPS 2024] Implementation of paper - D-LLM: A Token Adaptive Computing Resource Allocation Strategy for Large Language Modelsβ23Updated 8 months ago
- https://tacc.ust.hkβ82Updated 2 years ago
- [ACL'25] Code for ACL'25 paper "IRT-Router: Effective and Interpretable Multi-LLM Routing via Item Response Theory"β22Updated 10 months ago
- Awesome list for LLM quantizationβ370Updated 2 months ago
- [TMLR 2024] Efficient Large Language Models: A Surveyβ1,239Updated 5 months ago
- Summary of some awesome work for optimizing LLM inferenceβ150Updated 2 weeks ago
- β63Updated last year
- The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)β307Updated 11 months ago
- DeepSeek Native Sparse Attention pytorch implementationβ109Updated last month
- PyTorch library for cost-effective, fast and easy serving of MoE models.β265Updated 2 months ago