An implementation of SGEMV with performance comparable to cuBLAS.
☆12May 21, 2021Updated 4 years ago
Alternatives and similar repositories for Optimizing-SGEMV-on-NVIDIA-GPUs
Users that are interested in Optimizing-SGEMV-on-NVIDIA-GPUs are comparing it to the libraries listed below
Sorting:
- Stepwise optimizations of DGEMM on CPU, reaching performance faster than Intel MKL eventually, even under multithreading.☆163Feb 3, 2022Updated 4 years ago
- A tool designed to compare energy and emission costs between computer chips☆13Dec 9, 2023Updated 2 years ago
- Bilinear Pairings Components Library for Delphi☆12Dec 19, 2018Updated 7 years ago
- Distributed Communication-Optimal Shuffle and Transpose Algorithm☆14Feb 20, 2026Updated 2 weeks ago
- Prototype of fraud proofs.☆12Feb 13, 2022Updated 4 years ago
- Benchmarks of all public available SNARK/STARK keccak circuits☆13Oct 1, 2023Updated 2 years ago
- Reconnaître la marque/modèle des véhicules dans une image☆11Mar 31, 2023Updated 2 years ago
- An open-source command line interface for linting your Ethereum 2.0 validator set up☆14May 17, 2021Updated 4 years ago
- 稀疏矩阵-向量乘的并行优化算法(OpenMP,AVX)☆11Jul 7, 2021Updated 4 years ago
- Implementation and analysis of five different GPU based SPMV algorithms in CUDA☆40Feb 5, 2019Updated 7 years ago
- Proof of concept code for VoteAgain paper☆10Jul 23, 2023Updated 2 years ago
- Formalizing Polynomial Commitment Schemes in the Interactive Theorem Prover Isabelle.☆10Jan 24, 2026Updated last month
- LLM RP TUI for Power Users.☆31Jan 13, 2026Updated last month
- Rust port of the streaming ddelta patch algorithm, based on bsdiff☆12May 2, 2024Updated last year
- A nim module to handle polynomials☆13Jun 7, 2022Updated 3 years ago
- A Python package to scrape flight data from Google Flights.☆15Apr 20, 2024Updated last year
- A test library for computing modular exponentiation in parallel using AVX-512 vector arithmetic☆12Dec 18, 2023Updated 2 years ago
- Distributed SDDMM Kernel☆12Jul 8, 2022Updated 3 years ago
- stateless model checking for thread libraries, kernels, and transactional memory☆10Dec 26, 2018Updated 7 years ago
- SGEMM and DGEMM subroutines using AVX512F instructions.☆15May 22, 2022Updated 3 years ago
- Exploration of primes, factorization and number theory through haskell☆10Oct 10, 2016Updated 9 years ago
- ☆11Nov 29, 2017Updated 8 years ago
- randomized SVD with single pass over data matrix☆10Apr 23, 2023Updated 2 years ago
- Clockwork: A Modular Arithmetic library for C++☆12Nov 18, 2025Updated 3 months ago
- Repository of papers released by Modulus Labs☆14Mar 13, 2024Updated last year
- negamax AI algorithm for turn-based games☆13Oct 6, 2019Updated 6 years ago
- ☆12Nov 23, 2020Updated 5 years ago
- Auto-built docker image with latest Nim devel version☆10Jul 21, 2023Updated 2 years ago
- ☆11Aug 11, 2024Updated last year
- Analog circuit simulation library; wrapper for ngspice☆10Aug 31, 2020Updated 5 years ago
- ☆12Apr 10, 2019Updated 6 years ago
- Prints a dot graph of a nim ast dumped using the `dumpTree` macro.☆13Sep 18, 2022Updated 3 years ago
- Dynamic analysis of multithreaded C programs☆13Feb 7, 2020Updated 6 years ago
- Comparison of leading error-correcting code implementations☆12Aug 19, 2022Updated 3 years ago
- [RFC9380] Hash to curves - Go reference implementation☆21Nov 20, 2025Updated 3 months ago
- Fast Bytecode Analysis☆15Jan 2, 2016Updated 10 years ago
- Site du programme Entrepreneurs d'Intérêt Général☆14Oct 6, 2022Updated 3 years ago
- ☆11Sep 10, 2024Updated last year
- DEPRECATED. This Scalapck repository is deprecated. The last version in this repository is 3.0. Refer to "aocl-scalapack" repository unde…☆10Mar 15, 2021Updated 4 years ago