IBM / stomp
STOMP: Scheduling Techniques Optimization in heterogeneous Multi-Processors
☆20Updated 4 months ago
Alternatives and similar repositories for stomp:
Users that are interested in stomp are comparing it to the libraries listed below
- Mini-ERA is a simplified still-representative version of the main ERA workload.☆14Updated 2 years ago
- A graph linear algebra overlay☆50Updated last year
- Simple SAT solver with CDCL implemented in Python☆16Updated 2 years ago
- CHO is a benchmark suite for OpenCL FPGA Accelerators☆18Updated 7 years ago
- SMASH is a hardware-software cooperative mechanism that enables highly-efficient indexing and storage of sparse matrices. The key idea of…☆15Updated 4 years ago
- Implementation of the HYPE hypergraph partitioner.☆18Updated 5 years ago
- Productive and portable performance programming across spatial architectures (FPGAs, etc.) and vector architectures (GPUs, etc.)☆30Updated 8 months ago
- The implementation for maximum clique enumeration algorithm☆11Updated 8 years ago
- A benchmark suite for Graph Machine Learning☆18Updated 3 months ago
- An Architecture-level Fault Injection Tool for GPU Application Resilience Evaluations☆16Updated 4 years ago
- SparseP is the first open-source Sparse Matrix Vector Multiplication (SpMV) software package for real-world Processing-In-Memory (PIM) ar…☆70Updated 2 years ago
- MAFIA: Multiple Application Framework for GPU architectures☆25Updated 3 years ago
- FPGA-based HyperLogLog Accelerator☆12Updated 4 years ago
- [CF ’20] Verified Instruction-Level Energy Consumption Measurement for NVIDIA GPUs☆15Updated 4 years ago
- SST Macro Element Library☆35Updated 3 months ago
- ☆36Updated 3 years ago
- GenStore is the first in-storage processing system designed for genome sequence analysis that greatly reduces both data movement and comp…☆12Updated 2 years ago
- Home of ALP/GraphBLAS and ALP/Pregel, featuring shared- and distributed-memory auto-parallelisation of linear algebraic and vertex-centri…☆25Updated this week
- A Deep-Reinforcement-Learning-Based Scheduler for FPGA HLS☆14Updated 3 years ago
- Streaming Message Interface: High-Performance Distributed Memory Programming on Reconfigurable Hardware☆16Updated 2 years ago
- IronMan+alpha: Graph Neural Network and Reinforcement Learning in High-Level Synthesis☆24Updated 2 years ago
- Convert C files into Verilog☆16Updated 6 years ago
- mini is mini☆19Updated 5 years ago
- DATuner Repository☆18Updated 6 years ago
- Global Memory and Threading runtime system☆23Updated 8 months ago
- Near-storage compute aware file system and FPGA operator pipelines.☆29Updated 2 years ago
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo☆19Updated last year
- https://arxiv.org/abs/1706.04972☆42Updated 6 years ago
- Concurrent CPU-GPU Programming using Task Models☆100Updated 5 years ago
- BLAS implementation for Intel FPGA☆76Updated 4 years ago