escalab/SIMD2

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/escalab/SIMD2)

escalab / SIMD2

☆31

Alternatives and similar repositories for SIMD2

Users that are interested in SIMD2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

escalab / GPTPU
View on GitHub
GPTPU for SC 2021
☆52Mar 22, 2023Updated 3 years ago
flashmobwalk / flashmob
View on GitHub
FlashMob is a shared-memory random walk system.
☆33Jul 7, 2023Updated 3 years ago
hpcgarage / cuASR
View on GitHub
cuASR: CUDA Algebra for Semirings
☆49Aug 22, 2022Updated 3 years ago
gunrock / mini
View on GitHub
mini is mini
☆20Jan 19, 2020Updated 6 years ago
PASSIONLab / distributed_sddmm
View on GitHub
Distributed SDDMM Kernel
☆12Jul 8, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
vickiegpt / wiki
View on GitHub
A place to store my knowledge base
☆12Apr 27, 2026Updated 2 months ago
regehr / pldi22-llvm-tutorial
View on GitHub
outline and links for PLDI 2022 tutorial
☆17Jun 13, 2022Updated 4 years ago
harvard-edge / Gables
View on GitHub
☆15Apr 3, 2020Updated 6 years ago
Lin-Mao / DrGPUM
View on GitHub
A memory profiler for NVIDIA GPUs to explore memory inefficiencies in GPU-accelerated applications.
☆36May 30, 2026Updated last month
microideax / T-DLA
View on GitHub
☆20Dec 3, 2019Updated 6 years ago
KireinaHoro / rocket-zynqmp
View on GitHub
☆13Jan 20, 2021Updated 5 years ago
GraphBLAS / python-suitesparse-graphblas
View on GitHub
Python CFFI Binding around SuiteSparse:GraphBLAS
☆24Apr 27, 2026Updated 2 months ago
FPGA-MAFIA / fpga_mafia
View on GitHub
Designing a Multi-Agent Fabric Integration Architecture to run on de10-lite FPGA.
☆18Apr 28, 2026Updated 2 months ago
UCLA-SEAL / HeteroGen
View on GitHub
HeteroGen: transpiling C to heterogeneous HLS code with automated test generation and program repair (ASPLOS 2022)
☆16Sep 25, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
pku-liang / AMOS
View on GitHub
Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators
☆125Oct 26, 2022Updated 3 years ago
uuudown / SBNN
View on GitHub
Singular Binarized Neural Network based on GPU Bit Operations (see our SC-19 paper)
☆17Dec 9, 2020Updated 5 years ago
jinzh-hust / GraphM
View on GitHub
An efficient storage system for concurrent graph processing
☆10Feb 1, 2021Updated 5 years ago
Accelergy-Project / micro22-sparseloop-artifact
View on GitHub
MICRO22 artifact evaluation for Sparseloop
☆48Aug 8, 2022Updated 3 years ago
spcl / mlir-dace
View on GitHub
Data-Centric MLIR dialect
☆47Oct 16, 2023Updated 2 years ago
lenLRX / AmpereSparseMatmul
View on GitHub
study of Ampere' Sparse Matmul
☆18Jan 10, 2021Updated 5 years ago
concept-inversion / C-SAW
View on GitHub
A Framework for Graph Sampling and Random Walk on GPUs.
☆38Feb 3, 2025Updated last year
xxyux / SpInfer
View on GitHub
SpInfer: Leveraging Low-Level Sparsity for Efficient Large Language Model Inference on GPUs
☆68Mar 25, 2025Updated last year
NGIOproject / PMTutorial
View on GitHub
Slides and exercises for persistent memory programming tutorial
☆14Nov 14, 2022Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
gevico / cosim-gpu
View on GitHub
a QEMU + gem5 co-simulation framework for AMD MI300X GPU research.
☆58Updated this week
weiT1993 / CutQC
View on GitHub
☆33Nov 11, 2024Updated last year
Jokeren / GPA
View on GitHub
GPU Performance Advisor
☆66Jul 25, 2022Updated 3 years ago
IST-DASLab / gemm-int8
View on GitHub
High Performance Int8 GEMM Kernels for SM80 and later GPUs.
☆23Mar 11, 2025Updated last year
ustcadsl / GraphWalker
View on GitHub
☆19Jul 1, 2020Updated 6 years ago
escalab / RTSpMSpM
View on GitHub
☆25Apr 13, 2025Updated last year
PolyArch / dsagen2
View on GitHub
Domain-Specific Architecture Generator 2
☆26Oct 2, 2022Updated 3 years ago
MoZeWei / moTuner
View on GitHub
☆10May 12, 2022Updated 4 years ago
ucb-bar / cva6-wrapper
View on GitHub
Wrapper for ETH Ariane Core
☆21Sep 2, 2025Updated 10 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
chhzh123 / ptc-tutorial
View on GitHub
PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo
☆17Mar 13, 2023Updated 3 years ago
IronySuzumiya / NiuDianNao
View on GitHub
A simple cycle-accurate DaDianNao simulator
☆13Mar 27, 2019Updated 7 years ago
google / iopddl
View on GitHub
Supplemental materials for The ASPLOS 2025 / EuroSys 2025 Contest on Intra-Operator Parallelism for Distributed Deep Learning
☆25May 12, 2025Updated last year
gunrock / graphblast
View on GitHub
High-Performance Linear Algebra-based Graph Primitives on GPUs
☆238Jul 2, 2021Updated 5 years ago
mitdbg / imputedb
View on GitHub
A database with automatic dynamic imputation of missing values.
☆11Nov 2, 2017Updated 8 years ago
infovillasimius / flows
View on GitHub
Network Flows Optimization - Shortest Path, Max Flow and Min Cost Flow Algorithms in Python
☆11Sep 13, 2019Updated 6 years ago
YukeWang96 / TC-GNN_ATC23
View on GitHub
Artifact for USENIX ATC'23: TC-GNN: Bridging Sparse GNN Computation and Dense Tensor Cores on GPUs.
☆58Oct 16, 2023Updated 2 years ago