A novell, highly-optimized CUDA implementation of k-means algorithm.
☆41Mar 3, 2022Updated 4 years ago
Alternatives and similar repositories for cuda-kmeans
Users that are interested in cuda-kmeans are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Locality sensitive hash functions for Tensorflow 2.0.☆12Feb 18, 2022Updated 4 years ago
- Experimental plugin for scikit-learn to be able to run (some estimators) on Intel GPUs via numba-dpex.☆17Feb 28, 2024Updated 2 years ago
- CUDA implementation of k-means☆23Dec 22, 2013Updated 12 years ago
- ICML2017 MEC: Memory-efficient Convolution for Deep Neural Network C++实现(非官方)☆17Apr 9, 2019Updated 7 years ago
- Benchmark tests supporting the TiledCUDA library.☆18Nov 19, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- sgx-based encrypted deduplication prototype☆13May 14, 2021Updated 4 years ago
- Medusa: Accelerating Serverless LLM Inference with Materialization [ASPLOS'25]☆12Nov 8, 2024Updated last year
- Playing Gather Town with Game Controller!☆11Aug 13, 2021Updated 4 years ago
- My tests and experiments with some popular dl frameworks.☆17Sep 11, 2025Updated 7 months ago
- ☆11Nov 14, 2023Updated 2 years ago
- Code in support of the paper Continuous Mixtures of Tractable Probabilistic Models☆12Oct 12, 2024Updated last year
- TiledLower is a Dataflow Analysis and Codegen Framework written in Rust.☆13Nov 23, 2024Updated last year
- A high-performance serving system for DeepRec based on TensorFlow Serving.☆20Nov 15, 2023Updated 2 years ago
- A GPU (CUDA) implementation, with a python interface, of the approximated KNN graph computation with Random Sample Forest algorithm KNN.☆12Feb 2, 2026Updated 2 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Not All Patches Are Equal: Hierarchical Dataset Condensation for Single Image Super-Resolution☆11May 7, 2024Updated last year
- A GPU FP32 computation method with Tensor Cores.☆26Dec 8, 2025Updated 4 months ago
- Repository for the COLM 2025 paper SpecDec++: Boosting Speculative Decoding via Adaptive Candidate Lengths☆18Jul 10, 2025Updated 9 months ago
- ☆13Sep 8, 2021Updated 4 years ago
- Noisy language compiler☆17Jul 31, 2024Updated last year
- Multimedia SoC Design with Specialization on Application Acceleration with High-Level-Synthesis [2020 Fall]☆12Jun 15, 2021Updated 4 years ago
- This is the python program which performs text summarization with pronoun replacement method. This method initially identifies pronouns i…☆10Dec 5, 2018Updated 7 years ago
- Selected problems and their solutions from the book on "Machine Intelligence in Design Automation"☆27Dec 9, 2018Updated 7 years ago
- MAC system with IEEE754 compatibility☆13Nov 22, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Nanoflann adaptor for n-dimensional search with Point Cloud Library (PCL) data types☆13Jan 22, 2024Updated 2 years ago
- Generalized enhanced suffix array construction in external memory [CPM'13, AMB 2017]☆17Aug 9, 2021Updated 4 years ago
- Induced Suffix Array and LCP construction based on the SAIS algorithm.☆11Dec 12, 2019Updated 6 years ago
- North Carolina State University: ECE 745 : Project: LC3 Microcontroller Functional Verification using SystemVerilog☆11Jun 5, 2017Updated 8 years ago
- ☆10Feb 17, 2017Updated 9 years ago
- Assignments of NTHU Course: Parallel Programming☆11Jan 14, 2019Updated 7 years ago
- ☆21Jun 24, 2021Updated 4 years ago
- Evaluating majors LLMs on the Abstraction and Reasoning Corpus☆17Nov 9, 2023Updated 2 years ago
- Collection of ROS 2 message definitions used throughout the implementation of micro-ROS, both in the agent and client endpoints.☆14Jan 19, 2026Updated 3 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Machine Learning on Arduino's Repository☆11Sep 12, 2020Updated 5 years ago
- Parallel implementation of the Advanced Encryption Standard.☆10Nov 13, 2018Updated 7 years ago
- ☆15Nov 12, 2023Updated 2 years ago
- A small package for converting pcd file to octomap *.bt/*.ot file☆13Apr 23, 2020Updated 6 years ago
- A ROS package for pointcloud filtering, segmentation(cluster extraction), coarse registration and ICP registration, used in bin picking q…☆13Jul 6, 2020Updated 5 years ago
- Hybrid methods for Parallel Betweenness Centrality on the GPU☆24Dec 20, 2018Updated 7 years ago
- CANBus example using STM32 and libopencm3☆11Jul 14, 2019Updated 6 years ago