NVIDIA / Forma
DSL for stencils and image processing
☆13Updated 8 years ago
Alternatives and similar repositories for Forma:
Users that are interested in Forma are comparing it to the libraries listed below
- Benchmark supporting baseless libel against clang-format☆11Updated 5 years ago
- Information about AVX-512 support on recent Intel processors☆45Updated 3 years ago
- Mirror kept for legacy. Moved to https://github.com/llvm/llvm-project☆25Updated 5 years ago
- code for examining determinism of performance counters☆21Updated 4 years ago
- Python bindings for libNVVM☆37Updated 11 years ago
- Ninja-based configuration system☆11Updated 5 years ago
- Mirror kept for legacy. Moved to https://github.com/llvm/llvm-project☆34Updated 5 years ago
- Asynchronous Task and Memory Interface, or ATMI, is a runtime framework and programming model for heterogeneous CPU-GPU systems. It provi…☆66Updated last year
- Sheriff consists of two tools: Sheriff-Detect, a false-sharing detector, and Sheriff-Protect, a false-sharing eliminator that you can lin…☆32Updated 6 years ago
- Predator: Predictive False Sharing Detection☆21Updated 10 years ago
- A portable high-level API with CUDA or OpenCL back-end☆54Updated 7 years ago
- Mirror kept for legacy. Moved to https://github.com/llvm/llvm-project☆25Updated 6 years ago
- This repository contains my experiments with compression-related algorithms☆35Updated 8 years ago
- A utility to dump GPU's property☆6Updated 10 years ago
- ☆75Updated last year
- Library with JIT (Just-in-time) compilation support to optimize performance of small and medium matrix multiplication☆14Updated 3 years ago
- A small little tool for dumping a floating-point number in its native format☆55Updated 9 years ago
- reverse engineering branch predictors☆17Updated 9 years ago
- Quick experiment to see how expensive safety is in C, for research☆12Updated 6 years ago
- Support for ternary logic in SSE, XOP, AVX2 and x86 programs☆31Updated 3 months ago
- Floating-Point Scalar Evolution☆12Updated 5 years ago
- Compute applications.☆24Updated 5 years ago
- Extended Roofline Model - LLVM source tree with additional libraries for the analysis of the dynamic execution in the interpreter☆17Updated 7 years ago
- C for Media Runtime☆24Updated 2 years ago
- Library to program with streams, events, and to queue own functions into a stream.☆16Updated 9 months ago
- A C/C++ task-based programming model for shared memory and distributed parallel computing.☆71Updated 4 years ago
- Vectorized intersections (research code)☆15Updated 8 years ago
- M:N fiber implementation, with transparent IO reactor and timeouts, POSIX like APIs.☆16Updated 12 years ago
- Generic C++11 PIMPL implementation☆11Updated 11 months ago
- Mirror kept for legacy. Moved to https://github.com/llvm/llvm-project☆17Updated 8 years ago