SiriusNEO / NightWizardLinks
SJTU CS2951 Computer Architecture Course Project, A Verilog HDL implemented RISC-V CPU.
☆10Updated 3 years ago
Alternatives and similar repositories for NightWizard
Users that are interested in NightWizard are comparing it to the libraries listed below
Sorting:
- MS108 Course Project, SJTU ACM Class.☆31Updated 2 years ago
- ☆78Updated last year
- ☆13Updated 2 years ago
- ☆123Updated this week
- A record of reading list on some MLsys popular topic☆16Updated 6 months ago
- A Compiler from "Mx* language" (A C++ & Java like language) to RV32I Assembly, with optimizations on LLVM IR. SJTU CS2966 Project.☆12Updated 2 years ago
- WaferLLM: Large Language Model Inference at Wafer Scale☆55Updated last week
- A RISC-V simulator☆36Updated 2 years ago
- GoPTX: Fine-grained GPU Kernel Fusion by PTX-level Instruction Flow Weaving☆18Updated last month
- Github repository of HPCA 2025 paper "UniNDP: A Unified Compilation and Simulation Tool for Near DRAM Processing Architectures"☆13Updated 3 weeks ago
- ☆187Updated last year
- ☆77Updated 3 years ago
- MAGIS: Memory Optimization via Coordinated Graph Transformation and Scheduling for DNN (ASPLOS'24)☆54Updated last year
- Artifact for paper "PIM is All You Need: A CXL-Enabled GPU-Free System for LLM Inference", ASPLOS 2025☆90Updated 4 months ago
- ☆24Updated last year
- InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management (OSDI'24)☆154Updated last year
- [HPCA'24] Smart-Infinity: Fast Large Language Model Training using Near-Storage Processing on a Real System☆49Updated 2 months ago
- Open-source Framework for HPCA2024 paper: Gemini: Mapping and Architecture Co-exploration for Large-scale DNN Chiplet Accelerators☆93Updated 4 months ago
- ☆12Updated 3 years ago
- NeuPIMs: NPU-PIM Heterogeneous Acceleration for Batched LLM Inferencing☆93Updated last year
- ☆17Updated 3 years ago
- [NeurIPS 2025] ClusterFusion: Expanding Operator Fusion Scope for LLM Inference via Cluster-Level Collective Primitive☆18Updated last week
- ☆89Updated last year
- ArkVale: Efficient Generative LLM Inference with Recallable Key-Value Eviction (NIPS'24)☆43Updated 9 months ago
- The code based on vLLM for the paper “ Cost-Efficient Large Language Model Serving for Multi-turn Conversations with CachedAttention”.☆11Updated last year
- ☆151Updated 7 months ago
- H2-LLM: Hardware-Dataflow Co-Exploration for Heterogeneous Hybrid-Bonding-based Low-Batch LLM Inference☆63Updated 5 months ago
- ☆52Updated last year
- UC Berkeley CS152 Computer Architecture and Engineering Labs☆25Updated 5 years ago
- ☆47Updated last week