hwang2006 / CUDA-Accelerated-ComputingLinks
☆11Updated 9 months ago
Alternatives and similar repositories for CUDA-Accelerated-Computing
Users that are interested in CUDA-Accelerated-Computing are comparing it to the libraries listed below
Sorting:
- NeuPIMs: NPU-PIM Heterogeneous Acceleration for Batched LLM Inferencing☆108Updated last year
- ATLAHS: An Application-centric Network Simulator Toolchain for AI, HPC, and Distributed Storage☆70Updated this week
- ☆224Updated 3 months ago
- PrIM (Processing-In-Memory benchmarks) is the first benchmark suite for a real-world processing-in-memory (PIM) architecture. PrIM is dev…☆169Updated last year
- ☆166Updated last year
- Advanced Matrix Extensions (AMX) Guide☆109Updated 4 years ago
- ☆66Updated 7 months ago
- Artifact for paper "PIM is All You Need: A CXL-Enabled GPU-Free System for LLM Inference", ASPLOS 2025☆124Updated 9 months ago
- LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale☆178Updated 6 months ago
- A repository where GPU applications are aggregated using a common build flow that supports multiple CUDA versions.☆91Updated last week
- WaferLLM: Large Language Model Inference at Wafer Scale☆87Updated last month
- ☆28Updated last year
- A Cycle-level simulator for M2NDP☆33Updated 5 months ago
- ☆58Updated last year
- A highly-flexible GPU simulator for AMD GPUs.☆214Updated last week
- UPMEM LLM Framework allows profiling PyTorch layers and functions and simulate those layers/functions with a given hardware profile.☆37Updated 6 months ago
- GPGPU-Sim provides a detailed simulation model of a contemporary GPU running CUDA and/or OpenCL workloads and now includes an integrated…☆67Updated 2 weeks ago
- A flexible, high-performance, user-friendly computer architecture simulator engine☆98Updated this week
- Processing-In-Memory (PIM) Simulator☆221Updated last year
- Artifact from "Hardware Compute Partitioning on NVIDIA GPUs". THIS IS A FORK OF BAKITAS REPO. I AM NOT ONE OF THE AUTHORS OF THE PAPER.☆55Updated 2 months ago
- PIMeval simulator and PIMbench suite☆44Updated 2 months ago
- ☆16Updated 2 years ago
- OSDI 2023 Welder, deeplearning compiler☆32Updated 2 years ago
- NEO is a LLM inference engine built to save the GPU memory crisis by CPU offloading☆84Updated 7 months ago
- An open-source simulator framework for neural processing units☆37Updated last week
- libsmctrl论文的复现,添加了python端接口,可以在python端灵活调用接口来分配计算资源☆12Updated last year
- TileFlow is a performance analysis tool based on Timeloop for fusion dataflows☆66Updated last year
- [HPCA'24] Smart-Infinity: Fast Large Language Model Training using Near-Storage Processing on a Real System☆52Updated 6 months ago
- MultiPIM: A Detailed and Configurable Multi-Stack Processing-In-Memory Simulator☆56Updated 4 years ago
- HyFiSS: A Hybrid Fidelity Stall-Aware Simulator for GPGPUs☆39Updated last year