wali-ku / BWLOCK-GPULinks

Protecting Real-Time GPU Kernels on Integrated CPU-GPU SoC Platforms

☆12

Alternatives and similar repositories for BWLOCK-GPU

Users that are interested in BWLOCK-GPU are comparing it to the libraries listed below

Sorting:

heechul / palloc
DRAM Bank-Aware Kernel Memory Allocator
☆42Updated 4 months ago
NVlabs / sassifi
An Architecture-level Fault Injection Tool for GPU Application Resilience Evaluations
☆17Updated 5 years ago
NUCAR-DEV / Hetero-Mark
A Benchmark Suite for Heterogeneous System Computation
☆53Updated 3 months ago
vancemiller / CUDA-preemption
Experiments evaluating preemption on the NVIDIA Pascal architecture
☆17Updated 8 years ago
CoffeeBeforeArch / nvbit_tools
☆13Updated 4 years ago
travisdowns / x86-loop-test
ASM methods to test small loop performance on x86
☆13Updated 5 years ago
ARM-software / synchronization-benchmarks
Collection of synchronization micro-benchmarks and traces from infrastructure applications
☆41Updated last week
ucb-bar / ccbench
Memory System Microbenchmarks
☆62Updated 2 years ago
sderek / CUDAAdvisor
CUDAAdvisor: a GPU profiling tool
☆49Updated 6 years ago
hipacc / hipacc
A domain-specific language and compiler for image processing
☆76Updated 4 years ago
fpgasystems / doppiodb
doppioDB - A hardware accelerated database
☆49Updated 8 years ago
boschresearch / ros2_response_time_analysis
Supplementary source code for the ECRTS 2019 paper 'Response-Time Analysis of ROS 2 Processing Chains under Reservation-Based Scheduling'
☆28Updated 4 years ago
mattsinc / heterosync
HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs
☆30Updated 8 months ago
CPFL / gdev
First-Class GPU Resource Management: Device Drivers, Runtimes, and CUDA Compilers for Nouveau.
☆48Updated 7 years ago
bthies / streamit
The StreamIt compiler infrastructure.
☆71Updated 8 years ago
adwaitjog / mafia
MAFIA: Multiple Application Framework for GPU architectures
☆27Updated 3 years ago
lightsighter / CudaDMA
Emulating DMA Engines on GPUs for Performance and Portability
☆40Updated 10 years ago
apc-llc / nvcc-llvm-ir
Enabling on-the-fly manipulations with LLVM IR code of CUDA sources
☆111Updated last month
dimdano / faiss-fpga
An FPGA integration and acceleration of the popular FAISS framework for approximate similarity search
☆23Updated 5 years ago
NicolasDenoyelle / Locality-Aware-Roofline-Model
Instanciate the Cache Aware Roofline Model on single socket and multisocket systems.
☆27Updated 6 years ago
lashgar / ipmacc
IPMACC is a framework for translating OpenACC for C API to CUDA, OpenCL, and Intel ISPC.
☆13Updated 2 years ago
pannotia / pannotia
Pannotia v0.9 is a suite of OpenCL graph applications
☆24Updated 7 years ago
escalab / GPTPU
GPTPU for SC 2021
☆52Updated 2 years ago
xyzsam / mallacc
Mallacc: Accelerating Memory Allocation
☆13Updated 7 years ago
zhiyisun / enable_arm_pmu
Enable user-mode access to ARMv7/Linux performance counters
☆42Updated 8 years ago
b-shi / PMC-PMI
Performance Counter Measurements at the cycle granularity
☆18Updated 3 years ago
Multi2Sim / m2s-bench-amdsdk-2.5-src
AMD Software Development Kit 2.5 Sources
☆10Updated 9 years ago
pnnl / COMET
☆41Updated 2 weeks ago
spcl / haystack
Haystack is an analytical cache model that given a program computes the number of cache misses.
☆46Updated 5 years ago
iml130 / mlir-emitc
Conversions to MLIR EmitC
☆128Updated 5 months ago