BrunoMeyer / gpu-rsfk
A GPU (CUDA) implementation, with a python interface, of the approximated KNN graph computation with Random Sample Forest algorithm KNN.
☆12Updated last year
Related projects ⓘ
Alternatives and complementary repositories for gpu-rsfk
- Near-storage compute aware file system and FPGA operator pipelines.☆29Updated 2 years ago
- ☆29Updated 2 years ago
- A simple script to plot the Roofline model for given HW platforms and applications☆9Updated 3 months ago
- ☆12Updated 4 years ago
- ☆19Updated last year
- ☆21Updated 2 years ago
- Multi-armed bandit algorithm with tensorflow and 11 policies☆13Updated last year
- Public Release of Stream-Dataflow☆14Updated 5 years ago
- Planetoid datasets. Consist of Cora, Pubmed, Citeseer, Large_Cora, nell.0.1, nell.0.01, nell.0.001.☆12Updated 5 years ago
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo☆19Updated last year
- Benchmark suite containing cache filtered traces for use with Ramulator. These include some of the workloads used in our SIGMETRICS 2019 …☆19Updated 4 years ago
- Convert C files into Verilog☆16Updated 5 years ago
- Fast and Scalable Method for Distributed Boolean Tensor Factorization (ICDE'17 & VLDBJ'19)☆6Updated 5 years ago
- An Architecture-level Fault Injection Tool for GPU Application Resilience Evaluations☆16Updated 4 years ago
- ☆11Updated 5 months ago
- A Language for Closed-form High-level ARchitecture Modeling☆19Updated 4 years ago
- Streaming Message Interface: High-Performance Distributed Memory Programming on Reconfigurable Hardware☆16Updated 2 years ago
- HWASim is a simulator for heterogeneous systems with CPUs and Hardware Accelerators (HWAs). It is released with the DASH memory scheduler…☆17Updated 8 years ago
- ☆13Updated 9 years ago
- Benchmarks, testbenches, and transformed codes for high-level synthesis research☆13Updated 7 years ago
- A C++ Library for Influence Maximization☆31Updated 3 months ago
- Code released to accompany the ISCA paper: "T4: Compiling Sequential Code for Effective Speculative Parallelization in Hardware"☆27Updated 2 years ago
- G3: A Programmable GNN Training System on GPU☆42Updated 4 years ago
- A Distributed Multi-GPU System for Fast Graph Processing☆63Updated 6 years ago
- A package for constructing sparse tensors from CSV-like data sources.☆10Updated 6 years ago
- 📝 "Synthesizing Benchmarks for Predictive Modeling" (🥇 CGO'17 Best Paper)☆22Updated last year
- SMASH is a hardware-software cooperative mechanism that enables highly-efficient indexing and storage of sparse matrices. The key idea of…☆15Updated 4 years ago
- ☆11Updated 4 years ago
- SForum 2020 : "A Run-time Hardware Routing Implementation for CGRA Overlays" code and data.☆11Updated 4 years ago
- Approximation-Aware Functional Reverse Engineering using Graph Neural Networks☆9Updated 2 years ago