escalab/RTSpMSpM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/escalab/RTSpMSpM)

escalab / RTSpMSpM

☆25

Alternatives and similar repositories for RTSpMSpM

Users that are interested in RTSpMSpM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ubc-aamodt-group / vulkan-sim
View on GitHub
Vulkan-Sim is a GPU architecture simulator for Vulkan ray tracing based on GPGPU-Sim and Mesa.
☆82Jan 31, 2025Updated last year
Deep-Learning-Profiling-Tools / fasten
View on GitHub
☆14Apr 24, 2024Updated 2 years ago
getianao / ngAP
View on GitHub
ngAP's artifact for ASPLOS'24
☆25Jul 29, 2025Updated 11 months ago
csl-iisc / MGVM-MICRO2022
View on GitHub
☆12Oct 25, 2022Updated 3 years ago
NeuraChip / neurachip
View on GitHub
NeuraChip Accelerator Simulator
☆16Apr 26, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
SNU-ARC / MERCI
View on GitHub
☆18May 8, 2021Updated 5 years ago
SYSU-SCC / sysu-scc-spack-repo
View on GitHub
Spack package repository maintained by Student Cluster Competition Team @ Sun Yat-sen University.
☆16Aug 20, 2025Updated 11 months ago
ZhangJingrong / gpu_topK_benchmark
View on GitHub
GPU TopK Benchmark
☆18Dec 19, 2024Updated last year
Leo9660 / HedraRAG_AE
View on GitHub
Artifact Evaluation for SOSP 2025
☆21Aug 16, 2025Updated 11 months ago
bigwater / gpunfa-artifact
View on GitHub
☆19Nov 21, 2022Updated 3 years ago
ShaoqiangLu / DFVG
View on GitHub
DFVG: A Heterogeneous Architecture for Speculative Decoding with Draft-on-FPGA and Verify-on-GPU.
☆25Nov 26, 2025Updated 7 months ago
araij / rabbit_order
View on GitHub
☆49Jan 30, 2026Updated 5 months ago
monellz / FlashTensor
View on GitHub
☆19Mar 4, 2025Updated last year
guqiqi / Samoyeds
View on GitHub
Samoyeds: Accelerating MoE Models with Structured Sparsity Leveraging Sparse Tensor Cores (EuroSys'25)
☆16Jul 17, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ParCoreLab / aCG
View on GitHub
GPU-accelerated linear solvers based on the conjugate gradient (CG) method, supporting NVIDIA and AMD GPUs with GPU-aware MPI, NCCL, RCCL…
☆16Mar 14, 2026Updated 4 months ago
redbird-arch / isca2025-chimera-artifact
View on GitHub
Artifact of Chimera
☆18May 6, 2025Updated last year
c3sr / tcu_scope
View on GitHub
☆50Jun 27, 2019Updated 7 years ago
Guangxuan-Xiao / SPMM-CUDA
View on GitHub
☆13Jun 23, 2022Updated 4 years ago
accel-sim / accel-sim-framework
View on GitHub
This is the top-level repository for the Accel-Sim framework.
☆626Mar 24, 2026Updated 3 months ago
pulp-platform / hwpe-mac-engine
View on GitHub
An example Hardware Processing Engine
☆12Feb 4, 2023Updated 3 years ago
PanZaifeng / FastTree-Artifact
View on GitHub
☆32Mar 24, 2025Updated last year
model-similarity / lm-similarity
View on GitHub
☆21Feb 10, 2025Updated last year
horizon-research / rtnn
View on GitHub
☆76Oct 6, 2025Updated 9 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
Lin-Mao / DrGPUM
View on GitHub
A memory profiler for NVIDIA GPUs to explore memory inefficiencies in GPU-accelerated applications.
☆36May 30, 2026Updated last month
CLab-HKUST-GZ / micro58-axcore
View on GitHub
☆41Oct 21, 2025Updated 8 months ago
mattsinc / heterosync
View on GitHub
HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs
☆32Sep 19, 2024Updated last year
Mys7erio / eBPF-Sentinel
View on GitHub
High-throughput, in-kernel network packet classification using machine learning (Proof-of-Concept)
☆15Aug 11, 2025Updated 11 months ago
AlibabaResearch / recom
View on GitHub
An Optimizing Compiler for Recommendation Model Inference
☆26Jun 5, 2025Updated last year
aoli-al / HFuse
View on GitHub
Horizontal Fusion
☆24Jan 7, 2022Updated 4 years ago
UCI-CORSA / TeLLMe_FPGA_2026
View on GitHub
TeLLMe: An Efficient End-to-End Ternary LLM Prefill and Decode Accelerator with Table-Lookup Matmul on Edge FPGAs [FPGA2026]
☆32Mar 11, 2026Updated 4 months ago
YukeWang96 / GNNAdvisor_OSDI21
View on GitHub
Artifact for OSDI'21 GNNAdvisor: An Adaptive and Efficient Runtime System for GNN Acceleration on GPUs.
☆71Mar 2, 2023Updated 3 years ago
AnyDSL / traversal
View on GitHub
AnyDSL traversal code
☆15Feb 18, 2019Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
cmuparlay / pbbsbench
View on GitHub
New version of pbbs benchmarks
☆97Nov 25, 2025Updated 7 months ago
RIKEN-RCCS / GEMMul8
View on GitHub
GEMMul8 (GEMMulate): GEMM emulation and its extension to BLAS-like matrix operations using INT8/FP8 matrix engines based on the Ozaki Sch…
☆82Jul 12, 2026Updated last week
ranggihwang / Pregated_MoE
View on GitHub
☆62May 4, 2024Updated 2 years ago
escalab / SIMD2
View on GitHub
☆31Jun 15, 2022Updated 4 years ago
Bihaqo / tf_einsum_opt
View on GitHub
Optimize the order of execution for tf.einsum
☆14May 31, 2017Updated 9 years ago
z-lab / flash-colreduce
View on GitHub
Fast, memory-efficient attention column reduction (e.g., sum, mean, max)
☆49Feb 10, 2026Updated 5 months ago
CMU-SAFARI / pLUTo
View on GitHub
pLUTo is a DRAM-based Processing-using-Memory architecture that leverages the high density of DRAM to enable the massively parallel stori…
☆20Jan 12, 2023Updated 3 years ago