Cooperative Primitives for CUDA C++ Kernel Authors. This repository contains CUB PRs from Q4 2019 until Q4 2020.
☆22Sep 23, 2020Updated 5 years ago
Alternatives and similar repositories for cub_historical_2019_2020
Users that are interested in cub_historical_2019_2020 are comparing it to the libraries listed below
Sorting:
- ☆27Dec 20, 2023Updated 2 years ago
- ☆27Updated this week
- Retargetable ML compilers for the twenty-first century!☆13Apr 22, 2025Updated 10 months ago
- Goal: a website to automatically train and certify compiler researchers and developers☆10Nov 24, 2019Updated 6 years ago
- Presentations, Videos, and Sample Source from Austin LLVM Meetups☆11Jul 23, 2020Updated 5 years ago
- A retargetable and extensible synthesis-based compiler for modern hardware architectures☆17Nov 20, 2025Updated 3 months ago
- a repo for how to set up xubuntu like me☆29Aug 5, 2023Updated 2 years ago
- CUDA executors☆14Dec 4, 2020Updated 5 years ago
- A System for Differential Debugging☆23Apr 10, 2025Updated 10 months ago
- CaPI: Compiler-assisted Performance Instrumentation☆18Mar 1, 2026Updated last week
- Examples and presentation for Pacific++/MeetingC++ talk "Benchmarking C++. From video games to algorithmic trading"☆17Oct 4, 2020Updated 5 years ago
- CacheFlow is a Linux kernel module that exposes the contents of the last-level cache on *most* ARM machines.☆17Jun 19, 2024Updated last year
- ☆19Oct 14, 2018Updated 7 years ago
- outline and links for PLDI 2022 tutorial☆17Jun 13, 2022Updated 3 years ago
- Spatial layout specifications for memory management systems.☆19Sep 2, 2020Updated 5 years ago
- Code for Spatial Semantic Embedding Network:Fast 3D Instance Segmentation with Deep Metric Learning☆42Oct 3, 2023Updated 2 years ago
- Library to interface Compilers and ML models for ML-Enabled Compiler Optimizations☆20Oct 19, 2025Updated 4 months ago
- ☆60Dec 9, 2025Updated 3 months ago
- A Collection of High Performance Parallel Skeletons for Tree Search Problems☆23Dec 5, 2025Updated 3 months ago
- Terminating is exciting☆22Nov 3, 2016Updated 9 years ago
- Experimental patches to implement missing C++20 modules features for the clang/LLVM toolchain.☆23Feb 16, 2022Updated 4 years ago
- An OpenMP runtime implemented using HPX☆24Aug 4, 2022Updated 3 years ago
- Rutgers APL correctly rounded math library☆32Mar 11, 2021Updated 4 years ago
- Statistics on GPUs☆33Sep 8, 2025Updated 6 months ago
- Project planning for the C++ Library Evolution Working Group☆99Sep 15, 2020Updated 5 years ago
- Clear My Record is a project to assist people the process of expunging their criminal convictions.☆10Nov 5, 2018Updated 7 years ago
- Slides of the Italian C++ Conference 2019☆21Jun 20, 2019Updated 6 years ago
- Criticality-aware Framework for Modeling Computer Performance☆33Dec 15, 2024Updated last year
- Examples of Automatic Differentiation (AD) in many different languages and systems☆27Jun 25, 2018Updated 7 years ago
- A determinizing tracer using Ptrace☆39Sep 20, 2020Updated 5 years ago
- ☆626Feb 20, 2026Updated 2 weeks ago
- Exercises for Learning MLIR (Originally written for PPoPP 2026)☆86Feb 5, 2026Updated last month
- ☆72Jun 23, 2020Updated 5 years ago
- Documenting Wasm SIMD performance☆37Jun 8, 2020Updated 5 years ago
- Department of Energy Standard Utility Library☆33Jan 30, 2026Updated last month
- pika is a C++ tasking library built on std::execution with fibers, CUDA, HIP, and MPI support.☆82Updated this week
- An Asynchronous Distributed C++ Array Processing Toolkit☆75Apr 6, 2022Updated 3 years ago
- Measures high-level timing and memory usage metrics during compilation☆77May 11, 2021Updated 4 years ago
- ☆34Apr 3, 2023Updated 2 years ago