readwrite112 / AGAThALinks

PPoPP24 AGAThA: Fast and Efficient GPU Acceleration of Guided Sequence Alignment for Long Read Mapping

☆20

Alternatives and similar repositories for AGAThA

Users that are interested in AGAThA are comparing it to the libraries listed below

Sorting:

AIS-SNU / PID-Comm
☆24Updated 7 months ago
rishucoding / reproduce_MICRO24_GPU_DLRM_inference
Sharing the codebase and steps for artifact evaluation/reproduction for MICRO 2024 paper
☆9Updated 10 months ago
yonsei-hpcp / pid-join
☆11Updated 2 months ago
YukeWang96 / MGG_OSDI23
Artifact for OSDI'23: MGG: Accelerating Graph Neural Networks with Fine-grained intra-kernel Communication-Computation Pipelining on Mult…
☆40Updated last year
AIS-SNU / Smart-Infinity
[HPCA'24] Smart-Infinity: Fast Large Language Model Training using Near-Storage Processing on a Real System
☆46Updated last year
arun-sub / genomicsbench
A benchmark suite to study the performance characteristics of genomics applications
☆32Updated 8 months ago
YukeWang96 / TC-GNN_ATC23
Artifact for USENIX ATC'23: TC-GNN: Bridging Sparse GNN Computation and Dense Tensor Cores on GPUs.
☆49Updated last year
tallendev / uvm-eval
This serves as a repository for reproducibility of the SC21 paper "In-Depth Analyses of Unified Virtual Memory System for GPU Accelerated…
☆33Updated last year
dovedevic / blimp
A PIM instrumentation, compilation, execution, simulation, and evaluation repository for BLIMP-style architectures.
☆18Updated 3 years ago
CMU-SAFARI / GenStore
GenStore is the first in-storage processing system designed for genome sequence analysis that greatly reduces both data movement and comp…
☆14Updated 3 years ago
nullplay / Workload-Aware-Co-Optimization
Workload-Aware Co-Optimization
☆8Updated 2 years ago
leesou / PIM-DL-ASPLOS
PIM-DL: Expanding the Applicability of Commodity DRAM-PIMs for Deep Learning via Algorithm-System Co-Optimization
☆31Updated last year
upmem / upmem_llm_framework
UPMEM LLM Framework allows profiling PyTorch layers and functions and simulate those layers/functions with a given hardware profile.
☆31Updated last week
pku-liang / MAGIS
MAGIS: Memory Optimization via Coordinated Graph Transformation and Scheduling for DNN (ASPLOS'24)
☆52Updated last year
Yufeng98 / CENT
Artifact for paper "PIM is All You Need: A CXL-Enabled GPU-Free System for LLM Inference", ASPLOS 2025
☆76Updated 2 months ago
PSAL-POSTECH / M2NDP-public
A Cycle-level simulator for M2NDP
☆28Updated 2 months ago
CMU-SAFARI / PyGim
PyGim is the first runtime framework to efficiently execute Graph Neural Networks (GNNs) on real Processing-in-Memory systems. It provide…
☆26Updated 2 months ago
YukeWang96 / GNNAdvisor_OSDI21
Artifact for OSDI'21 GNNAdvisor: An Adaptive and Efficient Runtime System for GNN Acceleration on GPUs.
☆66Updated 2 years ago
SNU-ARC / Ginex
Ginex: SSD-enabled Billion-scale Graph Neural Network Training on a Single Machine via Provably Optimal In-memory Caching
☆38Updated last year
OSU-STARLAB / UVM_benchmark
☆27Updated 4 years ago
dglai / FeatGraph
Sparse kernels for GNNs based on TVM
☆17Updated 4 years ago
PAA-NCIC / GSWITCH
A pattern-based algorithmic autotuner for graph processing on GPUs.
☆31Updated 3 weeks ago
jeongminpark417 / GIDS
☆35Updated last month
miglopst / PIM_NDP_papers
☆65Updated 4 years ago
owensgroup / ATOS
Multi-GPU dynamic scheduler using PGAS style cross-GPU communication
☆29Updated last year
YukeWang96 / QGTC_PPoPP22
Artifact for PPoPP22 QGTC: Accelerating Quantized GNN via GPU Tensor Core.
☆30Updated 3 years ago
platformxlab / G10
☆37Updated last year
ucamrl / xrlflow
☆15Updated 2 years ago
AutomataLab / Subway
Out-of-GPU-Memory Graph Processing with Minimal Data Transfer
☆55Updated 2 years ago
AIS-SNU / Optimus-CC
[ASPLOS'23] Optimus-CC: Efficient Large NLP Model Training with 3D Parallelism Aware Communication Compression
☆6Updated 11 months ago