vqd8a / DFAGE
A Deterministic Finite Automata GPU-based Engine
☆17Updated 7 years ago
Alternatives and similar repositories for DFAGE:
Users that are interested in DFAGE are comparing it to the libraries listed below
- ☆9Updated 7 years ago
- High-performance automata-processing engines are traditionally evaluated using a limited set of regular expression rulesets. While regula…☆32Updated last year
- An FPGA integration and acceleration of the popular FAISS framework for approximate similarity search☆23Updated 5 years ago
- Artifact of ASPLOS'23 paper entitled: GRACE: A Scalable Graph-Based Approach to Accelerating Recommendation Model Inference☆18Updated 2 years ago
- PyTorch-Direct code on top of PyTorch-1.8.0nightly (e152ca5) for Large Graph Convolutional Network Training with GPU-Oriented Data Commun…☆44Updated last year
- ColTraIn HBFP Training Emulator☆16Updated 2 years ago
- A Distributed Multi-GPU System for Fast Graph Processing☆65Updated 6 years ago
- A framework for pipelined computing on GPU☆29Updated 5 years ago
- Automata Benchmark Suite☆21Updated last year
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆15Updated 5 years ago
- An efficient concurrent graph processing system☆46Updated 3 years ago
- Inference framework for MoE layers based on TensorRT with Python binding☆41Updated 3 years ago
- OpenGraph is an open-source graph processing benchmarking suite written in pure C/OpenMP.☆12Updated last year
- Multi-way graph partitioning algorithms: FMS (Fiduccia-Mattheyses-Sanchis), PLM (Partitioning by Locked Moves), PFM (Partitioning by Free…☆38Updated 4 years ago
- ☆9Updated last year
- High performance RDMA-based distributed feature collection component for training GNN model on EXTREMELY large graph☆52Updated 2 years ago
- SMASH is a hardware-software cooperative mechanism that enables highly-efficient indexing and storage of sparse matrices. The key idea of…☆16Updated 4 years ago
- Implementation of FusedMM method for IPDPS 2021 paper titled "FusedMM: A Unified SDDMM-SpMM Kernel for Graph Embedding and Graph Neural N…☆30Updated 2 years ago
- Runtime Tracing Library for TensorFlow☆43Updated 6 years ago
- Light-weight GPU kernel interface for graph operations☆15Updated 4 years ago
- CUDA templates for tile-sparse matrix multiplication based on CUTLASS.☆51Updated 7 years ago
- Code for paper "Design Principles for Sparse Matrix Multiplication on the GPU" accepted to Euro-Par 2018☆71Updated 4 years ago
- FTPipe and related pipeline model parallelism research.☆41Updated last year
- Development repository for integrating FlexFlow (A distributed deep learning framework that supports flexible parallelization strategies)…☆28Updated 3 years ago
- pytorch ucc plugin☆21Updated 3 years ago
- LonestarGPU: Irregular algorithms parallelized for GPUs☆35Updated 5 years ago
- TLB Benchmarks☆33Updated 7 years ago
- VASim is a virtual homogeneous non-deterministic finite automata automata simulator and transformation tool. VASim can parse, transform, …☆36Updated 11 months ago
- Blaze runtime system that support efficient accelerator integration for big data.☆24Updated 8 years ago
- Scalable GPU Kernel Fission/Fusion Transformation for Memory-Bound Kernels☆13Updated 9 years ago