kisupov / grnsLinks
Computations in residue number system using CUDA-enabled GPUs
☆13Updated 4 years ago
Alternatives and similar repositories for grns
Users that are interested in grns are comparing it to the libraries listed below
Sorting:
- Multiple-precision GPU accelerated linear algebra routines (dense and sparse) based on residue number system☆18Updated 2 years ago
- ☆12Updated 3 years ago
- TP-PARSEC: A Task Parallel PARSEC Benchmark Suite☆10Updated 4 years ago
- A basic implementation of the Small Primes Number-Theoretic Transform (NTT) multiplication algorithm.☆24Updated 7 years ago
- Haystack is an analytical cache model that given a program computes the number of cache misses.☆46Updated 5 years ago
- A low-level intermediate representation for hardware description languages☆28Updated 4 years ago
- GPTPU for SC 2021☆52Updated 2 years ago
- A Parallelism Profiler with What-If analyses for Intel Threading Building Blocks (TBB) programs☆13Updated 7 years ago
- ☆16Updated 6 years ago
- GPUVerify: a Verifier for GPU Kernels☆62Updated 2 years ago
- Polyhedral Compilation tool for High Level Synthesis.☆10Updated 11 years ago
- The CLooG Code Generator in the Polyhedral Model☆47Updated 2 years ago
- Productive and portable performance programming across spatial architectures (FPGAs, etc.) and vector architectures (GPUs, etc.)☆31Updated last year
- Chunky Loop Interaction☆24Updated 5 years ago
- Polyite: Iterative Schedule Optimization for Parallelization in the Polyhedron Model☆12Updated 5 years ago
- Embedded Universal DSL: a good DSL for us, by us☆38Updated this week
- A tracing JIT for PyTorch☆17Updated 2 years ago
- Torch Frontend for IREE☆25Updated last year
- FPGA acceleration of arbitrary precision floating point computations.☆40Updated 3 years ago
- CUDA accelerated(X) Multi-Precision library☆90Updated 8 years ago
- [CF ’20] Verified Instruction-Level Energy Consumption Measurement for NVIDIA GPUs☆15Updated 4 years ago
- Code templates to get started experimenting with the RISC-V LLVM toolchain☆14Updated 6 years ago
- Package for performing fixed-point, arbitrary-precision arithmetic in Python.☆65Updated last year
- A library for working with the posit number type.☆15Updated 4 years ago
- Comparison of leading error-correcting code implementations☆12Updated 2 years ago
- A 8-/16-/32-/64-bit floating point number family☆17Updated 3 years ago
- Optimized implementations of the Number Theoretic Transform (NTT) algorithm for the ring R/(X^N + 1) where N=2^m.☆24Updated 3 years ago
- Languages, Tools, and Techniques for Accelerator Design☆33Updated 3 years ago
- ☆21Updated 3 years ago
- CUDAAdvisor: a GPU profiling tool☆49Updated 6 years ago