Oyyko / USTC-Computer-ArchitectureLinks
USTC 体系结构 资料
☆13Updated 3 years ago
Alternatives and similar repositories for USTC-Computer-Architecture
Users that are interested in USTC-Computer-Architecture are comparing it to the libraries listed below
Sorting:
- ☆141Updated last week
- Self implementation of course projects for Computer Architecture 2022 Spring☆10Updated 3 years ago
- The code based on vLLM for the paper “ Cost-Efficient Large Language Model Serving for Multi-turn Conversations with CachedAttention”.☆11Updated last year
- [SIGMOD 2025] PQCache: Product Quantization-based KVCache for Long Context LLM Inference☆81Updated 2 weeks ago
- Homework assignments of Fundamental of Artificial Intelligence (USTC 2020 spring)☆19Updated 5 years ago
- 2022年龙芯杯个人赛 单发射110M(含icache)☆47Updated 3 years ago
- HPC-Lab for High Performance Computing course, 2023 Spring , Tsinghua Universit. 高性能计算导论 @ THU.☆24Updated 2 years ago
- WaferLLM: Large Language Model Inference at Wafer Scale☆78Updated last month
- InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management (OSDI'24)☆167Updated last year
- Github repository of HPCA 2025 paper "UniNDP: A Unified Compilation and Simulation Tool for Near DRAM Processing Architectures"☆16Updated 2 weeks ago
- ☆26Updated last year
- ☆13Updated 3 years ago
- 高级计算机体系结构2020,吴俊敏老师,中科大研究生课程☆73Updated last year
- ☆35Updated last year
- ☆12Updated last year
- ☆214Updated 2 months ago
- ☆54Updated 3 months ago
- Cheat papers for CS courses in USTC. USTC计算机半开卷大抄☆41Updated 4 years ago
- Large Language Model (LLM) Serving Paper and Resource List☆24Updated 7 months ago
- ArkVale: Efficient Generative LLM Inference with Recallable Key-Value Eviction (NIPS'24)☆50Updated last year
- 中国科学院大学(UCAS)2020年春季学期计算机组成原理实验课作业☆15Updated 3 years ago
- GoPTX: Fine-grained GPU Kernel Fusion by PTX-level Instruction Flow Weaving☆19Updated 4 months ago
- Summary of some awesome work for optimizing LLM inference☆151Updated 3 weeks ago
- Open-source Framework for HPCA2024 paper: Gemini: Mapping and Architecture Co-exploration for Large-scale DNN Chiplet Accelerators☆106Updated 7 months ago
- Repository for HPCGame 1st Problems.☆69Updated last year
- Flash Attention from Scratch on CUDA Ampere☆96Updated 3 months ago
- Code for "Computer Architecture" in 2020 Spring.☆28Updated 5 years ago
- 智能计算系统 AI Computing Systems 陈云霁☆185Updated 3 years ago
- Open-source implementation for "Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow"☆74Updated 2 months ago
- A graph pattern mining framework for large graphs on gpu.☆15Updated last year