ChenyangZhang-cs / iMLBenchLinks
iMLBench is a machine learning benchmark suite targeting CPU-GPU integrated architectures.
☆11Updated 4 years ago
Alternatives and similar repositories for iMLBench
Users that are interested in iMLBench are comparing it to the libraries listed below
Sorting:
- ☆210Updated last month
- A pattern-based algorithmic autotuner for graph processing on GPUs.☆31Updated 6 months ago
- TLB Benchmarks☆35Updated 8 years ago
- Multi-GPU dynamic scheduler using PGAS style cross-GPU communication☆29Updated 2 years ago
- Fine-grained GPU sharing primitives☆147Updated 5 months ago
- ☆36Updated last year
- Dorylus: Affordable, Scalable, and Accurate GNN Training☆76Updated 4 years ago
- Virtual Memory Abstraction for Serverless Architectures☆49Updated 3 years ago
- ☆41Updated 2 years ago
- A benchmarking suite for heterogeneous systems. The primary goal of this project is to improve and update aspects of existing benchmarkin…☆43Updated last year
- ☆24Updated 2 years ago
- ☆31Updated last year
- [USENIX ATC 2021] Exploring the Design Space of Page Management for Multi-Tiered Memory Systems☆48Updated 3 years ago
- Pond: CXL-Based Memory Pooling Systems for Cloud Platforms (ASPLOS'23)☆216Updated last year
- Code for paper "Engineering a High-Performance GPU B-Tree" accepted to PPoPP 2019☆57Updated 3 years ago
- Paella: Low-latency Model Serving with Virtualized GPU Scheduling☆66Updated last year
- ☆32Updated 5 years ago
- A LogGOPS (LogP, LogGP, LogGPS) Simulator and Simulation Framework☆14Updated last year
- PetPS: Supporting Huge Embedding Models with Tiered Memory☆33Updated last year
- ☆23Updated 2 years ago
- A GPU-accelerated DNN inference serving system that supports instant kernel preemption and biased concurrent execution in GPU scheduling.☆44Updated 3 years ago
- Hydra adds resilience and high availability to remote memory solutions.☆33Updated 3 years ago
- A User-Transparent Block Cache Enabling High-Performance Out-of-Core Processing with In-Memory Programs☆75Updated last month
- A Memory-Disaggregated Managed Runtime.☆67Updated 4 years ago
- ☆56Updated 4 years ago
- Rcmp: Reconstructing RDMA-based Memory Disaggregation via CXL☆60Updated 2 years ago
- Tartan: Evaluating Modern GPU Interconnect via a Multi-GPU Benchmark Suite☆68Updated 7 years ago
- ☆25Updated 3 years ago
- PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications☆127Updated 3 years ago
- GPUDirect Async support for IB Verbs☆134Updated 3 years ago