Efficient CUDA Stream Compaction Library
☆35Jun 9, 2023Updated 2 years ago
Alternatives and similar repositories for cuStreamComp
Users that are interested in cuStreamComp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Experiments evaluating preemption on the NVIDIA Pascal architecture☆16Nov 10, 2016Updated 9 years ago
- MIT-licensed stand-alone CUDA utility functions.☆16Jul 3, 2020Updated 5 years ago
- An Open Source Kepler GPU Assembler☆21Jan 23, 2017Updated 9 years ago
- TLB Benchmarks☆35Sep 11, 2017Updated 8 years ago
- ☆27Oct 26, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- simd enabled column imprints☆11Feb 12, 2018Updated 8 years ago
- assembler for NVIDIA FERMI. Imported from Google Code☆74Mar 22, 2015Updated 11 years ago
- Use tensor core to calculate back-to-back HGEMM (half-precision general matrix multiplication) with MMA PTX instruction.☆13Nov 3, 2023Updated 2 years ago
- Simple CTC implementation for PyTorch☆14Oct 25, 2017Updated 8 years ago
- Python bindings for NVTX☆66Jun 9, 2023Updated 2 years ago
- Full-speed Array of Structures access☆177Apr 25, 2023Updated 2 years ago
- Third party assembler and GEMM library for NVIDIA Kepler GPU☆84Oct 8, 2019Updated 6 years ago
- A library with space-filling curve algorithms (analysis, neighbor-finding, visualization) and other utilities (math, geometry, image proc…☆25Oct 18, 2017Updated 8 years ago
- Transform geometry positions with a 4x4 transformation matrix.☆13Dec 27, 2015Updated 10 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Panther is an open-source, highly efficient text editor written from scratch in C++.☆16Jan 25, 2018Updated 8 years ago
- Full and flexible code to simulate several Markowitz Portfolios using R and free stock market data.☆13Nov 22, 2020Updated 5 years ago
- Applies one iteration of Loop's algorithm to a triangular mesh☆12Mar 12, 2015Updated 11 years ago
- Query engine synthesizer based on, our domain-specific language, VOILA☆13Mar 2, 2021Updated 5 years ago
- Researching the forward-backward algorithm☆11Aug 3, 2018Updated 7 years ago
- A Free-Software JavaScript Library made by people for the people!☆10Aug 1, 2020Updated 5 years ago
- ☆111Apr 19, 2024Updated last year
- Convert CUDA programs from float data type to half or half2 with SIMDization☆20May 28, 2019Updated 6 years ago
- a quick primer on making prettier (and more impactful) plots☆14Sep 27, 2015Updated 10 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆18Mar 12, 2025Updated last year
- ☆55Feb 5, 2026Updated last month
- Hackathon project for Snarky workshop.☆11Jun 21, 2019Updated 6 years ago
- A utility for mesh simplification☆15Apr 27, 2015Updated 10 years ago
- A package in C++ for character or word ngram analysis. It uses Ternary Search Tree instead of hashing table for faster ngram frequency co…☆20May 11, 2015Updated 10 years ago
- Musical Gestures Toolbox for Matlab☆10Dec 21, 2020Updated 5 years ago
- An alternative wrapper for orbit-camera that works independently of game-shell.☆21Apr 18, 2018Updated 7 years ago
- Assembler for NVIDIA Volta and Turing GPUs☆239Jan 13, 2022Updated 4 years ago
- Wrapper of the OpenSSL elliptic curve functions for easy Python manipulation☆11Apr 30, 2014Updated 11 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A code sample demonstrating how to share and rebuild a PyTorch GPU tensor via its pointer/reference between different processes.☆16Aug 27, 2024Updated last year
- CUDA Data Parallel Primitives Library☆438Nov 9, 2018Updated 7 years ago
- Implementations of compression schemes for numeric data from mass spectrometers.☆13Mar 3, 2021Updated 5 years ago
- A simple sparse bitmap implementation in java☆22Jan 28, 2016Updated 10 years ago
- [WIP] Fill an n-dimensional array by interpolating functions that define the boundaries☆11Feb 24, 2019Updated 7 years ago
- Repository holding the code base to AC-SpGEMM : "Adaptive Sparse Matrix-Matrix Multiplication on the GPU"☆31Jul 7, 2020Updated 5 years ago
- Allocation benchmarks☆31Jul 6, 2016Updated 9 years ago