A novell, highly-optimized CUDA implementation of k-means algorithm.
☆42Mar 3, 2022Updated 4 years ago
Alternatives and similar repositories for cuda-kmeans
Users that are interested in cuda-kmeans are comparing it to the libraries listed below
Sorting:
- sgx-based encrypted deduplication prototype☆14May 14, 2021Updated 4 years ago
- Locality sensitive hash functions for Tensorflow 2.0.☆12Feb 18, 2022Updated 4 years ago
- Experimental plugin for scikit-learn to be able to run (some estimators) on Intel GPUs via numba-dpex.☆16Feb 28, 2024Updated 2 years ago
- A GPU (CUDA) implementation, with a python interface, of the approximated KNN graph computation with Random Sample Forest algorithm KNN.☆12Feb 2, 2026Updated last month
- Benchmark tests supporting the TiledCUDA library.☆18Nov 19, 2024Updated last year
- My tests and experiments with some popular dl frameworks.☆17Sep 11, 2025Updated 5 months ago
- TiledLower is a Dataflow Analysis and Codegen Framework written in Rust.☆14Nov 23, 2024Updated last year
- Noisy language compiler☆17Jul 31, 2024Updated last year
- Cylindrical Shape Decomposition☆16Dec 8, 2022Updated 3 years ago
- Sparse-dense matrix-matrix multiplication on GPUs☆14Oct 15, 2018Updated 7 years ago
- TiledKernel is a code generation library based on macro kernels and memory hierarchy graph data structure.☆19May 12, 2024Updated last year
- ICML2017 MEC: Memory-efficient Convolution for Deep Neural Network C++实现(非官方)☆17Apr 9, 2019Updated 6 years ago
- ☆21Jun 24, 2021Updated 4 years ago
- A GPU FP32 computation method with Tensor Cores.☆26Dec 8, 2025Updated 3 months ago
- End to End steps for adding custom ops in PyTorch.☆24Aug 20, 2020Updated 5 years ago
- ☆24May 6, 2022Updated 3 years ago
- ☆27Mar 2, 2023Updated 3 years ago
- ☆42Nov 1, 2025Updated 4 months ago
- CUDA implementation of k-means☆23Dec 22, 2013Updated 12 years ago
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆31Apr 2, 2025Updated 11 months ago
- A machine model for line-rate programmable switches☆26Oct 8, 2016Updated 9 years ago
- An Agile Chisel-Based SoC Design Framework☆26Dec 29, 2021Updated 4 years ago
- ☆32Sep 9, 2017Updated 8 years ago
- Repository holding the code base to AC-SpGEMM : "Adaptive Sparse Matrix-Matrix Multiplication on the GPU"☆31Jul 7, 2020Updated 5 years ago
- SimplePIM is the first high-level programming framework for real-world processing-in-memory (PIM) architectures. Described in the PACT 20…☆31Oct 23, 2023Updated 2 years ago
- Hop-Wise Graph Attention for Scalable and Generalizable Learning on Circuits☆35Aug 25, 2024Updated last year
- A GPU-based LZSS compression algorithm, highly tuned for NVIDIA GPGPUs and for streaming data, leveraging the respective strengths of CPU…☆38Dec 10, 2015Updated 10 years ago
- Kinematic and dynamic models of continuum and articulated soft robots.☆15Nov 22, 2025Updated 3 months ago
- This is a game interface called the doudizhu by Qt,and I only imitated the interface simply.The object has thr function of random license…☆12Sep 6, 2018Updated 7 years ago
- ☆21Oct 14, 2025Updated 4 months ago
- ☆10Oct 24, 2021Updated 4 years ago
- Attention in SRAM on Tenstorrent Grayskull☆40Jul 18, 2024Updated last year
- PTX-EMU is a simple emulator for CUDA program.☆38Apr 25, 2025Updated 10 months ago
- An ITK module to compute 3D thickness☆42Nov 12, 2025Updated 3 months ago
- ☄☄彗星密码本,基于Taro的微信小程序☆11Aug 18, 2021Updated 4 years ago
- lab solutions of ICS course☆10Jan 20, 2013Updated 13 years ago
- Line Follower Robot Code With PID to control error position. Only P and D actived☆10Jan 31, 2021Updated 5 years ago
- FPGA Based GPS Synchronized Clock☆10May 7, 2021Updated 4 years ago
- ☆12May 20, 2019Updated 6 years ago