b0nes164 / GPUPrefixSumsLinks
A nearly complete collection of prefix sum algorithms implemented in CUDA, D3D12, Unity and WGPU. Theoretically portable to all wave/warp/subgroup sizes.
☆262Updated 8 months ago
Alternatives and similar repositories for GPUPrefixSums
Users that are interested in GPUPrefixSums are comparing it to the libraries listed below
Sorting:
- State of the art sorting and segmented sorting, including OneSweep. Implemented in CUDA, D3D12, and Unity style compute shaders. Theoreti…☆387Updated 10 months ago
- Sample benchmark demonstrating the VK_KHR_cooperative_matrix extension☆98Updated 4 months ago
- A possible use of Slang shader compiler together with WebGPU in C++ (both in native and Web contexts), using CMake.☆75Updated 10 months ago
- Spatially Hashed Radiance Cache (SHaRC) Library☆88Updated 2 weeks ago
- FidelityFX Parallel Sort☆112Updated 4 years ago
- ☆124Updated 2 months ago
- Neural Network in Dx12/HLSL Shaders☆109Updated 5 months ago
- An implementation of NVIDIA's paper "Efficient Incoherent Ray Traversal on GPUs Through Compressed Wide BVHs"☆124Updated 10 months ago
- Source Code for Eurographics 2024 Short Paper "Real-time Seamless Object Space Shading"☆79Updated last year
- The code to accompany "Constant Time Stateless Shuffling and Grouping"☆46Updated 2 years ago
- One stop shop for getting started with SPIR-V.☆218Updated last week
- Code accompanying the blog post on bvh construction.☆432Updated last year
- A micro Vulkan compute pipeline and a collection of benchmarking compute shaders☆252Updated 6 months ago
- Demo project for the large scale game component with CBTs☆164Updated last year
- ☆77Updated 3 years ago
- continuous level of detail mesh library☆309Updated 3 weeks ago
- HLSL code for https://developer.nvidia.com/blog/optimizing-compute-shaders-for-l2-locality-using-thread-group-id-swizzling/☆66Updated last year
- GPU Ray Tracing Library☆85Updated 5 months ago
- Collection of meshlet generation algorithms☆119Updated 7 months ago
- A compute shader implementation of the OneSweep sorting algorithm.☆69Updated last year
- Samplin' Safari is a research tool to visualize and interactively inspect high-dimensional (quasi) Monte Carlo samplers.☆161Updated last month
- Vulkan sample on VK_EXT_device_generated_commands and NV extension☆44Updated 7 months ago
- SH for HLSL 2021☆234Updated 7 months ago
- LucidRaster: real-time GPU software rasterizer for exact OIT☆54Updated 7 months ago
- My master bibliography file with publications mostly in computer graphics, rendering, transport theory, and statistics.☆184Updated 3 weeks ago
- The Tauray renderer☆126Updated 6 months ago
- 💡 Experimental real-time global illumination renderer 🦀☆75Updated last year
- ☆150Updated 8 months ago
- nanothread — Minimal thread pool for task parallelism☆85Updated last month
- A DirectX12-based C++-application that allows graphics programmers to learn and experiment with the new Work Graphs feature using HLSL sh…☆115Updated 3 months ago