krulis-martin / cuda-kmeansView external linksLinks
A novell, highly-optimized CUDA implementation of k-means algorithm.
☆42Mar 3, 2022Updated 3 years ago
Alternatives and similar repositories for cuda-kmeans
Users that are interested in cuda-kmeans are comparing it to the libraries listed below
Sorting:
- Locality sensitive hash functions for Tensorflow 2.0.☆12Feb 18, 2022Updated 3 years ago
- sgx-based encrypted deduplication prototype☆14May 14, 2021Updated 4 years ago
- A GPU (CUDA) implementation, with a python interface, of the approximated KNN graph computation with Random Sample Forest algorithm KNN.☆12Feb 2, 2026Updated 2 weeks ago
- Experimental plugin for scikit-learn to be able to run (some estimators) on Intel GPUs via numba-dpex.☆16Feb 28, 2024Updated last year
- My tests and experiments with some popular dl frameworks.☆17Sep 11, 2025Updated 5 months ago
- TiledLower is a Dataflow Analysis and Codegen Framework written in Rust.☆14Nov 23, 2024Updated last year
- Noisy language compiler☆17Jul 31, 2024Updated last year
- Cylindrical Shape Decomposition☆16Dec 8, 2022Updated 3 years ago
- TiledKernel is a code generation library based on macro kernels and memory hierarchy graph data structure.☆19May 12, 2024Updated last year
- Sparse-dense matrix-matrix multiplication on GPUs☆14Oct 15, 2018Updated 7 years ago
- ☆21Jun 24, 2021Updated 4 years ago
- ICML2017 MEC: Memory-efficient Convolution for Deep Neural Network C++实现(非官方)☆17Apr 9, 2019Updated 6 years ago
- A GPU FP32 computation method with Tensor Cores.☆26Dec 8, 2025Updated 2 months ago
- End to End steps for adding custom ops in PyTorch.☆24Aug 20, 2020Updated 5 years ago
- ☆24May 6, 2022Updated 3 years ago
- ☆27Mar 2, 2023Updated 2 years ago
- ☆42Nov 1, 2025Updated 3 months ago
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆32Apr 2, 2025Updated 10 months ago
- An Agile Chisel-Based SoC Design Framework☆26Dec 29, 2021Updated 4 years ago
- Repository holding the code base to AC-SpGEMM : "Adaptive Sparse Matrix-Matrix Multiplication on the GPU"☆31Jul 7, 2020Updated 5 years ago
- [Medical Image Analysis] Residual Aligner-based Network (RAN): Motion-Aware Structure for Coarse-to-fine Discontinuous Deformable Registr…☆33Updated this week
- ☆32Sep 9, 2017Updated 8 years ago
- SimplePIM is the first high-level programming framework for real-world processing-in-memory (PIM) architectures. Described in the PACT 20…☆31Oct 23, 2023Updated 2 years ago
- Analysis for the traces from byteprofile☆32Nov 21, 2023Updated 2 years ago
- Transformers components but in Triton☆34May 9, 2025Updated 9 months ago
- Hop-Wise Graph Attention for Scalable and Generalizable Learning on Circuits☆35Aug 25, 2024Updated last year
- A GPU-based LZSS compression algorithm, highly tuned for NVIDIA GPGPUs and for streaming data, leveraging the respective strengths of CPU…☆38Dec 10, 2015Updated 10 years ago
- A Python script to help solve the 0.2 BTC Puzzle by testing custom identified seed words against a target address☆12Aug 26, 2024Updated last year
- This is a game interface called the doudizhu by Qt,and I only imitated the interface simply.The object has thr function of random license…☆12Sep 6, 2018Updated 7 years ago
- ☆20Oct 14, 2025Updated 4 months ago
- PTX-EMU is a simple emulator for CUDA program.☆37Apr 25, 2025Updated 9 months ago
- Attention in SRAM on Tenstorrent Grayskull☆40Jul 18, 2024Updated last year
- An ITK module to compute 3D thickness☆42Nov 12, 2025Updated 3 months ago
- Line Follower Robot Code With PID to control error position. Only P and D actived☆10Jan 31, 2021Updated 5 years ago
- FPGA Based GPS Synchronized Clock☆10May 7, 2021Updated 4 years ago
- A simple guide and tutorial on using TensorRT for accelerating a simple Multi-Layer Perceptron (MLP). This repository includes step-by-st…☆10Jan 29, 2025Updated last year
- ☆12May 20, 2019Updated 6 years ago
- LITS: An Optimized Learned Index for Strings☆13Jun 18, 2025Updated 7 months ago
- WeChat Pay sdk in python☆12Mar 9, 2018Updated 7 years ago