jbarczak / HAXWell
Code which loads custom ISA on Intel Haswell GPUs
☆47Updated 8 years ago
Alternatives and similar repositories for HAXWell:
Users that are interested in HAXWell are comparing it to the libraries listed below
- a simple SPIR-V parser☆26Updated 9 years ago
- Tweaked version of "Aha" - "A Hacker's Assistant" superoptimiser by Henry S. Warren☆58Updated 2 years ago
- Unusual uses of SSE2 registers☆65Updated 4 years ago
- LZSSE compression codec ported to SIMDe☆19Updated 4 years ago
- Generate SSE expressions from prefix expressions☆20Updated 14 years ago
- Intriman is a documentation generator that retargets the Intel Intrinsics Guide to other documentation formats☆28Updated 2 years ago
- Sparse texture Hello World☆29Updated 9 years ago
- Talvos is a dynamic-analysis framework and debugger for Vulkan/SPIR-V programs.☆73Updated 5 years ago
- Many functions in C for sorting the nibbles in an 8-byte word☆33Updated 10 years ago
- mini-LZ library☆33Updated 2 years ago
- compact lzma decoder☆14Updated 5 years ago
- Branchless UTF-8 decoder☆33Updated 7 years ago
- Comparing linear and binary searches☆39Updated 4 years ago
- Realtime raytracer using SIMD on ARM, MIPS, PPC and x86☆26Updated last month
- JIT Assembler Library for multiple ISAs☆74Updated 10 years ago
- Random Number Generator based on hardware-accelerated AES instructions☆56Updated 5 years ago
- A collection of shader compiler bugs.☆49Updated 6 years ago
- SIMD macro assembler unified for ARM, MIPS, PPC and x86☆88Updated 3 months ago
- Documenting Wasm SIMD performance☆36Updated 4 years ago
- Pruning elements in SIMD vectors (i.e., packing left elements)☆64Updated last year
- A terminal viewer for x86 instruction/intrinsic information using Python 3 + curses☆128Updated 2 years ago
- an assembler/compiler for AMD’s GCN (Generation Core Next Architecture) Assembly Language☆39Updated 2 years ago
- Dissector GPU debugger☆25Updated 9 years ago
- Marlin: high throughput entropy compressor☆30Updated last year
- AVX optimized ray stream tracing☆19Updated 8 years ago
- TLSF: two-level segregated fit O(1) allocator☆77Updated 2 years ago
- A scoped stack allocator☆36Updated 5 years ago
- Generic SIMD intrinsic to allow for portable SIMD intrinsic programming☆42Updated 11 years ago
- LZ77/LZSS designed for SSE based decompression☆135Updated 5 years ago
- Very fast backtraces resolver☆33Updated 6 years ago