kisupov / grnsLinks
Computations in residue number system using CUDA-enabled GPUs
☆14Updated 4 years ago
Alternatives and similar repositories for grns
Users that are interested in grns are comparing it to the libraries listed below
Sorting:
- Multiple-precision GPU accelerated linear algebra routines (dense and sparse) based on residue number system☆20Updated 2 years ago
- ☆12Updated 3 years ago
- Python tools for NVIDIA Profiler☆21Updated 7 years ago
- ☆16Updated 6 years ago
- A basic implementation of the Small Primes Number-Theoretic Transform (NTT) multiplication algorithm.☆24Updated 7 years ago
- CUDA accelerated(X) Multi-Precision library☆91Updated 8 years ago
- Polyite: Iterative Schedule Optimization for Parallelization in the Polyhedron Model☆12Updated 5 years ago
- A enumerator for MLIR, relying on the information given by IRDL.☆19Updated this week
- GEMM and Winograd based convolutions using CUTLASS☆26Updated 5 years ago
- TP-PARSEC: A Task Parallel PARSEC Benchmark Suite☆10Updated 4 years ago
- Haystack is an analytical cache model that given a program computes the number of cache misses.☆46Updated 6 years ago
- A low-level intermediate representation for hardware description languages☆28Updated 5 years ago
- Data Dependence Analyzer in the Polyhedral Model☆20Updated last year
- Comparison of leading error-correcting code implementations☆12Updated 2 years ago
- ☆13Updated 4 years ago
- Arpra is a C library for analyzing the propagation of numerical error in arbitrary precision IEEE-754 floating-point computations.☆25Updated 2 years ago
- LLVM-Canon aims to transform LLVM modules into a canonical form by reordering and renaming instructions while preserving the same semanti…☆15Updated last year
- Code templates to get started experimenting with the RISC-V LLVM toolchain☆14Updated 6 years ago
- Library to plot integer sets and maps☆49Updated 8 years ago
- Chunky Loop Interaction☆24Updated 5 years ago
- Goal: a website to automatically train and certify compiler researchers and developers☆10Updated 5 years ago
- [CF ’20] Verified Instruction-Level Energy Consumption Measurement for NVIDIA GPUs☆15Updated 4 years ago
- NeuroVectorizer is a framework that uses deep reinforcement learning (RL) to predict optimal vectorization compiler pragmas for for loops…☆94Updated 2 years ago
- ☆23Updated last week
- Torch Frontend for IREE☆25Updated last year
- Declarative MLIR compilers in Python!☆35Updated 4 years ago
- Embedded Universal DSL: a good DSL for us, by us☆40Updated this week
- Cryptoleq: A Heterogeneous Abstract Machine for Encrypted and Unencrypted Computation.☆30Updated 10 months ago
- A Parallelism Profiler with What-If analyses for Intel Threading Building Blocks (TBB) programs☆13Updated 7 years ago
- A lightweight MLIR Python frontend with support for PyTorch☆23Updated 10 months ago