jbarczak / HAXWellLinks
Code which loads custom ISA on Intel Haswell GPUs
☆47Updated 9 years ago
Alternatives and similar repositories for HAXWell
Users that are interested in HAXWell are comparing it to the libraries listed below
Sorting:
- Many functions in C for sorting the nibbles in an 8-byte word☆33Updated 10 years ago
- a simple SPIR-V parser☆26Updated 10 years ago
- Intriman is a documentation generator that retargets the Intel Intrinsics Guide to other documentation formats☆28Updated 3 years ago
- A header only Boolean Propagator Network framework for the omni-directional computation of Integer mathematical functions and computation…☆14Updated 7 years ago
- Unusual uses of SSE2 registers☆67Updated 5 years ago
- Sparse texture Hello World☆29Updated 10 years ago
- A tweaked version of Aha! ("A Hacker's Assistant") the superoptimiser by Henry S. Warren☆58Updated 3 years ago
- mini-LZ library☆34Updated 2 years ago
- Generate SSE expressions from prefix expressions☆20Updated 14 years ago
- Talvos is a dynamic-analysis framework and debugger for Vulkan/SPIR-V programs.☆74Updated 6 years ago
- TLSF: two-level segregated fit O(1) allocator☆80Updated 3 years ago
- SIMD macro assembler unified for ARM, MIPS, PPC and x86☆90Updated 9 months ago
- A C library for runtime-flippable feature flags on Linux/x86-64, with negligible overhead in the common case☆74Updated 2 years ago
- Very fast backtraces resolver☆33Updated 7 years ago
- Realtime raytracer using SIMD on ARM, MIPS, PPC and x86☆26Updated 7 months ago
- JIT Assembler Library for multiple ISAs☆76Updated 10 years ago
- Random Number Generator based on hardware-accelerated AES instructions☆60Updated 6 years ago
- "CF3" is a C compiler test suite targeting arithmetic optimization.☆37Updated 8 years ago
- LZ77/LZSS designed for SSE based decompression☆142Updated 6 years ago
- compile time assembly interpreter☆85Updated 7 years ago
- Public domain linear time distance field and Voronoi diagram on lattice grid☆55Updated 8 years ago
- a tool for querying Dwarf (debuginfo) graphs☆55Updated last year
- ZP7: Zach's Peppy Parallel-Prefix-Popcountin' PEXT/PDEP Polyfill☆55Updated last year
- LZSSE compression codec ported to SIMDe☆19Updated 5 years ago
- Branchless UTF-8 decoder☆35Updated 7 years ago
- A terminal viewer for x86 instruction/intrinsic information using Python 3 + curses☆128Updated 2 years ago
- SPMD in C++☆68Updated 5 years ago
- Reference manual for ForwardCom instruction set and software standards☆169Updated 7 months ago
- Marlin: high throughput entropy compressor☆30Updated 2 years ago
- Simple, single-file, nibble-based, adaptive rANS library with SSE2-accelerated modeling☆23Updated 6 years ago