itzmeanjan / ff-gpuLinks
Finite Field Operations on GPGPU
☆15Updated 2 years ago
Alternatives and similar repositories for ff-gpu
Users that are interested in ff-gpu are comparing it to the libraries listed below
Sorting:
- SST Macro Element Library☆36Updated 3 months ago
- Home of ALP/GraphBLAS and ALP/Pregel, featuring shared- and distributed-memory auto-parallelisation of linear algebraic and vertex-centri…☆28Updated this week
- DeepSZ: A Novel Framework to Compress Deep Neural Networks by Using Error-Bounded Lossy Compression☆11Updated 4 years ago
- BLAS implementation for Intel FPGA☆77Updated 4 years ago
- The Combinatorial BLAS (CombBLAS) is an extensible distributed-memory parallel graph library offering a small but powerful set of linear …☆79Updated last month
- OpenSHMEM Application Programming Interface☆58Updated 10 months ago
- Official BOLT Repository☆31Updated last year
- The CLooG Code Generator in the Polyhedral Model☆51Updated 2 years ago
- GPTPU for SC 2021☆52Updated 2 years ago
- ☆31Updated 3 years ago
- ☆13Updated 3 years ago
- CUDA accelerated(X) Multi-Precision library☆92Updated 9 years ago
- Artifact for PPoPP 2018 paper "Making Pull-Based Graph Processing Performant"☆23Updated 5 years ago
- Simplified Interface to Complex Memory☆28Updated 2 years ago
- ☆40Updated last week
- CUDAAdvisor: a GPU profiling tool☆50Updated 7 years ago
- A unified framework across multiple programming platforms☆41Updated 3 months ago
- Multiple 1-stencil implementations using nvidia cuda.☆13Updated 7 years ago
- Global Memory and Threading runtime system☆25Updated last year
- Custom-Precision Floating-point numbers.☆38Updated 8 months ago
- Sandia OpenSHMEM is an implementation of the OpenSHMEM specification over multiple Networking APIs, including Portals 4, the Open Fabric …☆73Updated last month
- development repository for the open earth compiler☆80Updated 4 years ago
- tools to create performance and roofline plots from measured data☆59Updated 11 years ago
- CGBN: CUDA Accelerated Multiple Precision Arithmetic (Big Num) using Cooperative Groups☆227Updated 7 months ago
- Intel’s HERACLES accelerator introduces a new set of fundamental instructions, the Polynomial Instructions Set Architecture (P-ISA) that …☆45Updated this week
- Portals is a low-level network API for high-performance networking on high-performance computing systems developed by Sandia National Lab…☆40Updated last year
- SparseP is the first open-source Sparse Matrix Vector Multiplication (SpMV) software package for real-world Processing-In-Memory (PIM) ar…☆76Updated 3 years ago
- A GPU algorithm for sparse matrix-matrix multiplication☆72Updated 4 years ago
- Languages, Tools, and Techniques for Accelerator Design☆33Updated 3 years ago
- Performance Prediction Toolkit☆53Updated 2 weeks ago