kisupov / grnsLinks

Computations in residue number system using CUDA-enabled GPUs

☆14

Alternatives and similar repositories for grns

Users that are interested in grns are comparing it to the libraries listed below

Sorting:

kisupov / mpres-blas
Multiple-precision GPU accelerated linear algebra routines (dense and sparse) based on residue number system
☆20Updated 2 years ago
benlwk / Tensorcrypto
☆12Updated 3 years ago
rossumai / nvprof-tools
Python tools for NVIDIA Profiler
☆21Updated 7 years ago
canalcache / canal
☆16Updated 6 years ago
Mysticial / ProtoNTT
A basic implementation of the Small Primes Number-Theoretic Transform (NTT) multiplication algorithm.
☆24Updated 7 years ago
NVlabs / xmp
CUDA accelerated(X) Multi-Precision library
☆91Updated 8 years ago
stganser / polyite
Polyite: Iterative Schedule Optimization for Parallelization in the Polyhedron Model
☆12Updated 5 years ago
opencompl / mlir-fuzz
A enumerator for MLIR, relying on the information given by IRDL.
☆19Updated this week
YashasSamaga / ConvolutionBuildingBlocks
GEMM and Winograd based convolutions using CUTLASS
☆26Updated 5 years ago
massivethreads / tp-parsec
TP-PARSEC: A Task Parallel PARSEC Benchmark Suite
☆10Updated 4 years ago
spcl / haystack
Haystack is an analytical cache model that given a program computes the number of cache misses.
☆46Updated 6 years ago
maerhart / llhd
A low-level intermediate representation for hardware description languages
☆28Updated 5 years ago
periscop / candl
Data Dependence Analyzer in the Polyhedral Model
☆20Updated last year
Bulat-Ziganshin / ECC-Benchmark
Comparison of leading error-correcting code implementations
☆12Updated 2 years ago
CoffeeBeforeArch / nvbit_tools
☆13Updated 4 years ago
arpra-project / arpra
Arpra is a C library for analyzing the propagation of numerical error in arbitrary precision IEEE-754 floating-point computations.
☆25Updated 2 years ago
michalpaszkowski / LLVM-Canon
LLVM-Canon aims to transform LLVM modules into a canonical form by reordering and renaming instructions while preserving the same semanti…
☆15Updated last year
Lichtso / riscv-llvm-templates
Code templates to get started experimenting with the RISC-V LLVM toolchain
☆14Updated 6 years ago
tobiasgrosser / islplot
Library to plot integer sets and maps
☆49Updated 8 years ago
ftynse / clint
Chunky Loop Interaction
☆24Updated 5 years ago
chunhualiao / freeCompilerCamp
Goal: a website to automatically train and certify compiler researchers and developers
☆10Updated 5 years ago
NMSU-PEARL / GPUs-Energy
[CF ’20] Verified Instruction-Level Energy Consumption Measurement for NVIDIA GPUs
☆15Updated 4 years ago
intel / neuro-vectorizer
NeuroVectorizer is a framework that uses deep reinforcement learning (RL) to predict optimal vectorization compiler pragmas for for loops…
☆94Updated 2 years ago
makslevental / mlir-wheels
☆23Updated last week
iree-org / iree-torch
Torch Frontend for IREE
☆25Updated last year
Mogball / declarative-mlir-compiler
Declarative MLIR compilers in Python!
☆35Updated 4 years ago
llvm / eudsl
Embedded Universal DSL: a good DSL for us, by us
☆40Updated this week
momalab / cryptoleq
Cryptoleq: A Heterogeneous Abstract Machine for Encrypted and Unencrypted Computation.
☆30Updated 10 months ago
rutgers-apl / TaskProf
A Parallelism Profiler with What-If analyses for Intel Threading Building Blocks (TBB) programs
☆13Updated 7 years ago
nod-ai / PI
A lightweight MLIR Python frontend with support for PyTorch
☆23Updated 10 months ago