Oyyko / USTC-Computer-ArchitectureLinks
USTC 体系结构 资料
☆13Updated 3 years ago
Alternatives and similar repositories for USTC-Computer-Architecture
Users that are interested in USTC-Computer-Architecture are comparing it to the libraries listed below
Sorting:
- ☆137Updated last week
- Self implementation of course projects for Computer Architecture 2022 Spring☆10Updated 3 years ago
- WaferLLM: Large Language Model Inference at Wafer Scale☆75Updated 3 weeks ago
- ☆52Updated 2 months ago
- The code based on vLLM for the paper “ Cost-Efficient Large Language Model Serving for Multi-turn Conversations with CachedAttention”.☆11Updated last year
- ☆31Updated last year
- Open-source implementation for "Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow"☆73Updated last month
- PIM-DL: Expanding the Applicability of Commodity DRAM-PIMs for Deep Learning via Algorithm-System Co-Optimization☆33Updated last year
- Github repository of HPCA 2025 paper "UniNDP: A Unified Compilation and Simulation Tool for Near DRAM Processing Architectures"☆15Updated 2 months ago
- [HPCA'24] Smart-Infinity: Fast Large Language Model Training using Near-Storage Processing on a Real System☆49Updated 4 months ago
- ☆79Updated 3 years ago
- [SIGMOD 2025] PQCache: Product Quantization-based KVCache for Long Context LLM Inference☆76Updated 3 weeks ago
- 高级计算机体系结构2020,吴俊敏老师,中科大研究生课程☆73Updated last year
- A graph pattern mining framework for large graphs on gpu.☆14Updated 11 months ago
- UC Berkeley CS152 Computer Architecture and Engineering Labs☆26Updated 5 years ago
- ☆13Updated 3 years ago
- ☆12Updated last year
- Large Language Model (LLM) Serving Paper and Resource List☆24Updated 6 months ago
- ☆40Updated last year
- ☆61Updated last month
- InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management (OSDI'24)☆161Updated last year
- ☆14Updated last year
- ☆40Updated 2 years ago
- The Artifact of NeoMem: Hardware/Software Co-Design for CXL-Native Memory Tiering☆59Updated last year
- 中国科学院大学(UCAS)2020年春季学期计算机组成原理实验课作业☆16Updated 3 years ago
- Artifact for paper "PIM is All You Need: A CXL-Enabled GPU-Free System for LLM Inference", ASPLOS 2025☆105Updated 6 months ago
- ☆207Updated last month
- ☆24Updated 9 months ago
- MAGIS: Memory Optimization via Coordinated Graph Transformation and Scheduling for DNN (ASPLOS'24)☆55Updated last year
- ☆23Updated last year