vqd8a / DFAGE
A Deterministic Finite Automata GPU-based Engine
☆17Updated 7 years ago
Alternatives and similar repositories for DFAGE:
Users that are interested in DFAGE are comparing it to the libraries listed below
- Artifact of ASPLOS'23 paper entitled: GRACE: A Scalable Graph-Based Approach to Accelerating Recommendation Model Inference☆17Updated last year
- High-performance automata-processing engines are traditionally evaluated using a limited set of regular expression rulesets. While regula…☆31Updated last year
- An efficient concurrent graph processing system☆46Updated 3 years ago
- ☆9Updated 7 years ago
- LonestarGPU: Irregular algorithms parallelized for GPUs☆33Updated 5 years ago
- Runtime Tracing Library for TensorFlow☆43Updated 6 years ago
- An IR for efficiently simulating distributed ML computation.☆27Updated last year
- A Distributed Multi-GPU System for Fast Graph Processing☆65Updated 6 years ago
- Machine Learning System☆14Updated 4 years ago
- Thinking is hard - automate it☆19Updated 2 years ago
- Graphiler is a compiler stack built on top of DGL and TorchScript which compiles GNNs defined using user-defined functions (UDFs) into ef…☆61Updated 2 years ago
- ☆73Updated 3 years ago
- GPU Optimization and Memory Abstraction Framework☆32Updated 5 years ago
- GPU Performance Advisor☆64Updated 2 years ago
- Light-weight GPU kernel interface for graph operations☆15Updated 4 years ago
- ☆23Updated 5 years ago
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆15Updated 5 years ago
- GVProf: A Value Profiler for GPU-based Clusters☆49Updated 10 months ago
- Codes of the paper "Speeding Up Set Intersections in Graph Algorithms using SIMD Instructions" that was published in SIGMOD 2018. Authors…☆26Updated 6 years ago
- A framework for pipelined computing on GPU☆29Updated 5 years ago
- ☆69Updated last year
- Artifacts for SOSP'19 paper Optimizing Deep Learning Computation with Automatic Generation of Graph Substitutions☆21Updated 2 years ago
- FTPipe and related pipeline model parallelism research.☆41Updated last year
- TLB Benchmarks☆33Updated 7 years ago
- ☆25Updated last year
- Kernel Fusion and Runtime Compilation Based on NNVM☆70Updated 8 years ago
- A library for syntactically rewriting Python programs, pronounced (sinner).☆70Updated 3 years ago
- ☆21Updated 2 years ago
- High performance RDMA-based distributed feature collection component for training GNN model on EXTREMELY large graph☆50Updated 2 years ago
- Set of datasets for the deep learning recommendation model (DLRM).☆41Updated 2 years ago