itzmeanjan / ff-gpu
Finite Field Operations on GPGPU
☆14Updated last year
Alternatives and similar repositories for ff-gpu:
Users that are interested in ff-gpu are comparing it to the libraries listed below
- A Benchmark Suite for Heterogeneous System Computation☆53Updated 3 months ago
- Loop Kernel Analysis and Performance Modeling Toolkit☆91Updated 4 months ago
- SST Macro Element Library☆35Updated 3 months ago
- ZFP Hardware Implementation☆13Updated 2 years ago
- A Method for efficiently processing SpMV using SIMD and load balancing☆16Updated 2 years ago
- XSBench: The Monte Carlo Macroscopic Cross Section Lookup Benchmark☆75Updated 10 months ago
- doppioDB - A hardware accelerated database☆48Updated 7 years ago
- Source code of the simulator used in the Mosaic paper from MICRO 2017: "Mosaic: A GPU Memory Manager with Application-Transparent Support…☆42Updated 6 years ago
- A networked FPGA key-value store written in Clash☆28Updated 10 months ago
- CUDAAdvisor: a GPU profiling tool☆48Updated 6 years ago
- Multiple 1-stencil implementations using nvidia cuda.☆13Updated 7 years ago
- The SparseX sparse kernel optimization library☆39Updated 6 years ago
- Machine Intelligence Shader Autogen. AMDGPU ML shader code generator. (previously iGEMMgen)☆34Updated 4 months ago
- Heterogeneous simulator for DECADES Project☆31Updated 8 months ago
- ☆40Updated last week
- OpenSHMEM Application Programming Interface☆52Updated 2 months ago
- ☆48Updated 5 years ago
- This package includes the implementation for four sparse linear algebra kernels: Sparse-Matrix-Vector-Multiplication (SpMV), Sparse-Trian…☆26Updated 4 years ago
- FPGA-based HyperLogLog Accelerator☆12Updated 4 years ago
- a clone of POCL that includes RISC-V newlib devices support and Vortex☆38Updated 7 months ago
- Performance Prediction Toolkit☆51Updated last month
- Benchmark suite containing cache filtered traces for use with Ramulator. These include some of the workloads used in our SIGMETRICS 2019 …☆19Updated 4 years ago
- SYCL Reference Manual☆27Updated 9 months ago
- DeepSZ: A Novel Framework to Compress Deep Neural Networks by Using Error-Bounded Lossy Compression☆11Updated 4 years ago
- The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.☆36Updated 3 years ago
- GPTPU for SC 2021☆51Updated last year
- HeteroSim is a full system simulator supporting x86 multicore processors combined with a FPGA via bus-based architecture. Flexible design…☆21Updated 8 years ago
- Meta-Repository for Bespoke Silicon Group's Manycore Architecture (A.K.A HammerBlade)☆38Updated last month
- A framework that support executing unmodified CUDA source code on non-NVIDIA devices.☆112Updated 3 weeks ago
- Decuda and cudasm, the CUDA binary utilities package. Low-level tools for NVidia G80 GPUs.☆97Updated 14 years ago