BrunoMeyer / gpu-rsfkLinks
A GPU (CUDA) implementation, with a python interface, of the approximated KNN graph computation with Random Sample Forest algorithm KNN.
☆12Updated this week
Alternatives and similar repositories for gpu-rsfk
Users that are interested in gpu-rsfk are comparing it to the libraries listed below
Sorting:
- Artifacts for SOSP'19 paper Optimizing Deep Learning Computation with Automatic Generation of Graph Substitutions☆21Updated 3 years ago
- Near-storage compute aware file system and FPGA operator pipelines.☆29Updated 3 years ago
- ☆14Updated 5 years ago
- Public Release of Stream-Dataflow☆14Updated 6 years ago
- Repository holding the code base to AC-SpGEMM : "Adaptive Sparse Matrix-Matrix Multiplication on the GPU"☆30Updated 5 years ago
- A neural branch predictor tested using CPU emulator, testing both supervised learning and reinforcement learning (for COS 583: Great Mome…☆15Updated 8 years ago
- 📝 "End-to-end Deep Learning of Optimization Heuristics" (🥇 PACT'17 Best Paper)☆72Updated 2 years ago
- Arrow Matrix Decomposition - Communication-Efficient Distributed Sparse Matrix Multiplication☆15Updated last year
- Convert C files into Verilog☆19Updated 6 years ago
- A Dataflow library for graph analytics acceleration☆14Updated 10 years ago
- A simulator of a memory controller designed for hybrid DRAM+NVM.☆21Updated 9 years ago
- Part of paper: Massively Parallel Combinational Binary Neural Networks for Edge Processing☆12Updated 6 years ago
- ☆21Updated 3 years ago
- Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation☆27Updated 6 years ago
- FPGA 2025 SAT Accel: A modern SAT Solver on FPGA Repository☆13Updated 9 months ago
- ☆22Updated 9 months ago
- SmartNIC☆14Updated 7 years ago
- ☆13Updated 10 years ago
- This is a repo which contains some details about how to use OpenCL backend (Xilinx/Intel).☆25Updated 6 years ago
- GraphZoom: A Multi-level Spectral Approach for Accurate and Scalable Graph Embedding (ICLR'20 Oral)☆114Updated 2 years ago
- [CF ’20] Verified Instruction-Level Energy Consumption Measurement for NVIDIA GPUs☆15Updated 5 years ago
- ☆31Updated 5 years ago
- Black-box Optimizer based on Bayesian Optimization☆159Updated last year
- An efficient concurrent graph processing system☆46Updated 4 years ago
- ☆11Updated 5 months ago
- BiSUNA framework specialized to compile for the Xilinx Alveo U50☆13Updated 5 years ago
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆15Updated 6 years ago
- Code released to accompany the ISCA paper: "T4: Compiling Sequential Code for Effective Speculative Parallelization in Hardware"☆28Updated 3 years ago
- ETHZ Heterogeneous Accelerated Compute Cluster.☆38Updated 2 months ago
- ☆14Updated last month