SGEMM and DGEMM subroutines using AVX512F instructions.
☆15May 22, 2022Updated 3 years ago
Alternatives and similar repositories for GEMM_AVX512F
Users that are interested in GEMM_AVX512F are comparing it to the libraries listed below
Sorting:
- Stepwise optimizations of DGEMM on CPU, reaching performance faster than Intel MKL eventually, even under multithreading.☆163Feb 3, 2022Updated 4 years ago
- Snakemake workflow for modelling-to-generate-alternatives with PyPSA-Eur☆11Oct 26, 2020Updated 5 years ago
- TSDG: An efficient index graph for graph-based nearest neighbor search☆10Jul 14, 2022Updated 3 years ago
- pstore, a high-performance, read-optimized database system.☆26Nov 14, 2013Updated 12 years ago
- AI Hedge Fund Repo integrate with DeepSeek V3 and R1 hosted on SiliconFlow.☆12Feb 3, 2025Updated last year
- Packages CodeScene in one container, Nginx proxy with self-signed cert in another, and composes them to one service. Ideas from codeclou/…☆13Aug 1, 2021Updated 4 years ago
- Sparse symmetric indefinite solver implemented with a runtime system☆13May 11, 2020Updated 5 years ago
- Computing the Committor with the Committor: an Anatomy of the Transition State Ensemble☆12Jul 12, 2024Updated last year
- 稀疏矩阵-向量乘的并行优化算法(OpenMP,AVX)☆11Jul 7, 2021Updated 4 years ago
- Example datasets and dashboards known to work well in OmniSci☆15Sep 25, 2020Updated 5 years ago
- Universal event notification broker/manager☆13May 20, 2023Updated 2 years ago
- DFT-D3 interface☆12Apr 3, 2023Updated 2 years ago
- [DEPRECATED] Non-blocking TCP or Unix connect☆14May 1, 2021Updated 4 years ago
- ☆10Apr 9, 2021Updated 4 years ago
- ☆13Mar 27, 2019Updated 6 years ago
- Go port of wyhash v3☆14Nov 1, 2022Updated 3 years ago
- Bindings to BLAS (Fortran)☆12May 28, 2025Updated 9 months ago
- fanotify cron system☆21Sep 15, 2015Updated 10 years ago
- 🗄️ Networked in-memory key-value store.☆11Jan 2, 2018Updated 8 years ago
- ☆10Jun 5, 2018Updated 7 years ago
- Eth2 data availability sampling - Testground plan☆10Nov 14, 2020Updated 5 years ago
- Legate Hello World Pedagogical Library☆10Apr 5, 2023Updated 2 years ago
- TD-DMRG and VHCI package☆11Jul 24, 2025Updated 7 months ago
- Numpy like ndarray and dataframe library for nim-lang.☆13Aug 6, 2020Updated 5 years ago
- Dependency-free and header-only C++11 XGen XPD cache I/O library.☆13Jun 23, 2019Updated 6 years ago
- Async datagram traits☆11Aug 28, 2019Updated 6 years ago
- tiny fast portable real-time deep neural network for regression and classification within 50 LOC.☆52Apr 6, 2021Updated 4 years ago
- Database plugins☆13Updated this week
- CUDA C simple application for Nvidia's GPU☆11Jun 7, 2022Updated 3 years ago
- ☆13Aug 18, 2025Updated 6 months ago
- Source code of the IPDPS '21 paper: "TileSpMV: A Tiled Algorithm for Sparse Matrix-Vector Multiplication on GPUs" by Yuyao Niu, Zhengyang…☆12Aug 12, 2022Updated 3 years ago
- CRINN - Free & Fast Framework for Approximate Nearest Neighbors Search Via Contrastive Reinforcement Learning☆77Aug 5, 2025Updated 7 months ago
- Python numerical optimization toolbox☆12Nov 20, 2018Updated 7 years ago
- Async readiness traits☆11May 15, 2019Updated 6 years ago
- Simple and efficient memory pool is implemented with C++11.☆10Jun 2, 2022Updated 3 years ago
- Anatomy of High-Performance GEMM with Online Fault Tolerance on GPUs☆13Apr 3, 2025Updated 11 months ago
- A tutorial/example of the Python C-API and integration with CUDA kernels.☆14Jul 7, 2019Updated 6 years ago
- Render glyphs from any font* without baking them in advance.☆10Sep 19, 2025Updated 5 months ago
- Pub/Sub engine☆14Sep 7, 2021Updated 4 years ago