brucechin / HardwareTest
hardware test for CPU,GPU,I/O,memory bandwidth performance
☆25Updated 6 years ago
Alternatives and similar repositories for HardwareTest:
Users that are interested in HardwareTest are comparing it to the libraries listed below
- Run SPEC CPU2006 on Linux with either an Intel, ARM, or PowerPC processors.☆25Updated 6 years ago
- LeapIO: Efficient and Portable Virtual NVMe Storage on ARM SoCs (ASPLOS'20)☆28Updated 3 years ago
- Near-storage compute aware file system and FPGA operator pipelines.☆29Updated 3 years ago
- Emulating DMA Engines on GPUs for Performance and Portability☆38Updated 9 years ago
- Benchmark suite containing cache filtered traces for use with Ramulator. These include some of the workloads used in our SIGMETRICS 2019 …☆19Updated 4 years ago
- User Space NVMe Driver☆23Updated 8 years ago
- QStack,a high-concurrency-and-low-latency user-level TCP stack for multicore systems, which can handle TCP concurrrent connection in 10 m…☆19Updated last year
- Enhanced PQOS (Intel RDT Software) with DDIO-related Functionalities☆15Updated 2 years ago
- 记录阅读各类paper的想法笔记(关注体系结构,机器学习系统,深度学习,计算机视觉)☆24Updated 5 years ago
- A disaggregated memory orchestration system that virtualizes cluster wide memory to scale data intensive, large memory workloads in virtu…☆13Updated 5 years ago
- Linux kernel source tree of developing SMDK kernel for CXL Memory☆10Updated last year
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo☆18Updated last year
- Artifact of ASPLOS'23 paper entitled: GRACE: A Scalable Graph-Based Approach to Accelerating Recommendation Model Inference☆17Updated last year
- Artifacts for SOSP'19 paper Optimizing Deep Learning Computation with Automatic Generation of Graph Substitutions☆21Updated 2 years ago
- An external memory allocator example for PyTorch.☆14Updated 3 years ago
- STREAMer: Benchmarking remote volatile and non-volatile memory bandwidth☆16Updated last year
- GPUDirect example☆58Updated 3 years ago
- Scalable GPU Kernel Fission/Fusion Transformation for Memory-Bound Kernels☆13Updated 9 years ago
- SmartNIC☆14Updated 6 years ago
- Persistent Collectives X- A collective communication library for high performance, low cost persistent collectives over RDMA devices.☆14Updated 6 years ago
- Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS☆19Updated 3 weeks ago
- High-performance eBPF implementation in hardware.☆27Updated 2 years ago
- A memory profiler for NVIDIA GPUs to explore memory inefficiencies in GPU-accelerated applications.☆25Updated 4 months ago
- Official resporitory for "IPDPS' 24 QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices".☆19Updated last year
- ☆60Updated 3 weeks ago
- CacheDirector - Sending Packets to the Right Slice by Exploiting Intel Last-Level Cache Addressing☆12Updated 5 years ago
- Reading seminar in Harvard Cloud Networking and Systems Group☆16Updated 2 years ago
- An NVMe Device Simulation Library.☆51Updated 2 years ago
- Mellanox libibverbs☆60Updated 5 years ago
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆15Updated 5 years ago