BrunoMeyer / gpu-rsfkLinks
A GPU (CUDA) implementation, with a python interface, of the approximated KNN graph computation with Random Sample Forest algorithm KNN.
☆12Updated 8 months ago
Alternatives and similar repositories for gpu-rsfk
Users that are interested in gpu-rsfk are comparing it to the libraries listed below
Sorting:
- Public Release of Stream-Dataflow☆14Updated 6 years ago
- Artifacts for SOSP'19 paper Optimizing Deep Learning Computation with Automatic Generation of Graph Substitutions☆21Updated 3 years ago
- ColTraIn HBFP Training Emulator☆16Updated 2 years ago
- A Dataflow library for graph analytics acceleration☆14Updated 9 years ago
- This is a repo which contains some details about how to use OpenCL backend (Xilinx/Intel).☆25Updated 5 years ago
- Implementations of various parallel algorithms for matrix factorization (including DSGD++)☆15Updated 8 years ago
- HWASim is a simulator for heterogeneous systems with CPUs and Hardware Accelerators (HWAs). It is released with the DASH memory scheduler…☆19Updated 9 years ago
- BiSUNA framework specialized to compile for the Xilinx Alveo U50☆12Updated 4 years ago
- Near-storage compute aware file system and FPGA operator pipelines.☆29Updated 3 years ago
- ☆13Updated 10 years ago
- Building KNN Graph for Billion High Dimensional Vectors Efficiently☆21Updated 6 years ago
- Code released to accompany the ISCA paper: "T4: Compiling Sequential Code for Effective Speculative Parallelization in Hardware"☆29Updated 3 years ago
- ☆21Updated 2 years ago
- Part of paper: Massively Parallel Combinational Binary Neural Networks for Edge Processing☆12Updated 6 years ago
- ☆14Updated 5 years ago
- ☆22Updated 6 months ago
- ETHZ Heterogeneous Accelerated Compute Cluster.☆36Updated 5 months ago
- A fast implementation of spectral clustering on GPU-CPU Platform☆32Updated 7 years ago
- Multi-armed bandit algorithm with tensorflow and 11 policies☆15Updated 2 years ago
- Xilinx Alveo Graph Analytics Product repository☆14Updated 3 years ago
- DATuner Repository☆18Updated 6 years ago
- Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation☆27Updated 5 years ago
- DAC'22 paper: "Automated Accelerator Optimization Aided by Graph Neural Networks"☆40Updated last year
- A Language for Closed-form High-level ARchitecture Modeling☆21Updated 5 years ago
- graph-based substructure pattern mining algorithm (authors: Xifeng Yan, Jiawei Han) implementation☆11Updated 8 years ago
- Modified version of PyTorch able to work with changes to GPGPU-Sim☆56Updated 2 years ago
- An implementation of a BinaryConnect network for cifar10☆11Updated 5 years ago
- Convert C files into Verilog☆17Updated 6 years ago
- Planetoid datasets. Consist of Cora, Pubmed, Citeseer, Large_Cora, nell.0.1, nell.0.01, nell.0.001.☆13Updated 5 years ago
- Benchmark for matrix multiplications between dense and block sparse (BSR) matrix in TVM, blocksparse (Gray et al.) and cuSparse.☆23Updated 5 years ago