CoffeeBeforeArch/mmul

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CoffeeBeforeArch/mmul)

CoffeeBeforeArch / mmul

Serial and parallel implementations of matrix multiplication

☆47

Alternatives and similar repositories for mmul

Users that are interested in mmul are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Bruce-Lee-LY / memory_pool
View on GitHub
Simple and efficient memory pool is implemented with C++11.
☆10Jun 2, 2022Updated 4 years ago
CoffeeBeforeArch / cache_simulator
View on GitHub
A simple trace-based cache simulator
☆16Jan 3, 2025Updated last year
spcl / SMI
View on GitHub
Streaming Message Interface: High-Performance Distributed Memory Programming on Reconfigurable Hardware
☆15Mar 1, 2022Updated 4 years ago
AcubeSAT / ccsds-space-data-link-protocols
View on GitHub
Implementation of the CCSDS TM and TC standards for the AcubeSAT nanosatellite
☆17Dec 22, 2025Updated 7 months ago
ctuning / ck-request-asplos18-resnet-tvm-fpga
View on GitHub
CK workflow, portable packages and other artifacts for the ReQuEST-ASPLOS'18 submission:
☆12Jan 16, 2019Updated 7 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
CoffeeBeforeArch / parallel_cpp
View on GitHub
☆132Feb 17, 2023Updated 3 years ago
david-macmahon / hashpipe
View on GitHub
High Availability Shared Pipeline Engine
☆17Sep 15, 2023Updated 2 years ago
MWATelescope / Birli
View on GitHub
A Rust library for preprocessing tasks in the Murchison Widefield Array (MWA) data pipeline.
☆18Jul 15, 2026Updated last week
gawasa29 / flutter_bluesky_clone
View on GitHub
bluesky clone built with Flutter using the bluesky package running on AT protocol
☆10Sep 9, 2023Updated 2 years ago
kcherenkov / Parallel-Programming-Labs
View on GitHub
MPI and OpenMP Examples
☆20Sep 1, 2017Updated 8 years ago
prem30488 / C2CUDATranslator
View on GitHub
Automatic Conversion of Source Code for C to CUDA C
☆23Apr 1, 2014Updated 12 years ago
codeplaysoftware / portFFT
View on GitHub
portFFT is a library implementing Fast Fourier Transforms using SYCL
☆19Mar 1, 2025Updated last year
olcf-tutorials / local_mpi_to_gpu
View on GitHub
How to use node-local MPI rank IDs to manually map MPI ranks to GPUs
☆14Apr 22, 2020Updated 6 years ago
Luca-Dalmasso / matrixTransposeCUDA
View on GitHub
CUDA C simple application for Nvidia's GPU
☆11Jun 7, 2022Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
IHPCSS / software-engineering
View on GitHub
Software engineering for the IHPCSS Laplace code
☆19Jul 15, 2026Updated last week
matheusb432 / rust-uchat
View on GitHub
'Build a Full-Stack Twitter Clone with Rust' course code and notes
☆14Aug 6, 2023Updated 2 years ago
philipdaquin / Twitter-Clone-WASM
View on GitHub
📖 Twitter- React TS, Apollo Federation, Async GraphQL, Actix Web framework, Postgres SQL, Docker, Docker Compose, Redis, Apache Kafka , …
☆15Aug 15, 2023Updated 2 years ago
wjc404 / GEMM_AVX512F
View on GitHub
SGEMM and DGEMM subroutines using AVX512F instructions.
☆15May 22, 2022Updated 4 years ago
marcsous / gpuSparse
View on GitHub
Matlab mex wrappers to cuSPARSE (NVIDIA)
☆11Dec 10, 2025Updated 7 months ago
eljost / sympleints
View on GitHub
Molecular integrals over Gaussian basis functions using sympy.
☆16Oct 2, 2024Updated last year
jsoneaday / build-twitter-api-clone-actix
View on GitHub
☆12Jul 2, 2023Updated 3 years ago
MoonKraken / hotblog
View on GitHub
A simple blogging web application built with the Leptos framework
☆14Sep 18, 2024Updated last year
buzh / slop
View on GitHub
A `top`-like utility for the Slurm HPC batch job scheduler
☆15Jun 9, 2026Updated last month
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
schmidtbri / task-queue-ml-model-deployment
View on GitHub
Deploying an ML Model in a Task Queue
☆11Jul 9, 2024Updated 2 years ago
NVIDIA / PRBench
View on GitHub
A CUDA implementation of the PageRank Pipeline Benchmark
☆32Jan 31, 2017Updated 9 years ago
XiaosongAI / Parallel-SpMV
View on GitHub
稀疏矩阵-向量乘的并行优化算法（OpenMP，AVX）
☆11Jul 7, 2021Updated 5 years ago
UCLA-VAST / FlexCNN
View on GitHub
☆74Feb 16, 2023Updated 3 years ago
jacobaustin123 / Python-C-API-CUDA-Tutorial
View on GitHub
A tutorial/example of the Python C-API and integration with CUDA kernels.
☆14Jul 7, 2019Updated 7 years ago
kykosic / actix-pytorch-example
View on GitHub
An example of using Torch rust bindings to serve trained machine learning models via Actix Web
☆17Aug 15, 2021Updated 4 years ago
shixun404 / Fault-Tolerant-SGEMM-on-NVIDIA-GPUs
View on GitHub
Anatomy of High-Performance GEMM with Online Fault Tolerance on GPUs
☆14Apr 3, 2025Updated last year
yzhaiustc / Optimizing-SGEMV-on-NVIDIA-GPUs
View on GitHub
An implementation of SGEMV with performance comparable to cuBLAS.
☆12May 21, 2021Updated 5 years ago
kxtxr / rust-actix-react-web-starter
View on GitHub
Rust (Actix & Diesel) + React (w/ Typescript) + MySQL starter pack. Currently serves my need for a nice Dev Environment.
☆16Apr 14, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
rise-lang / mlir
View on GitHub
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github…
☆33Jul 20, 2021Updated 5 years ago
SuperScientificSoftwareLaboratory / TileSpMV
View on GitHub
Source code of the IPDPS '21 paper: "TileSpMV: A Tiled Algorithm for Sparse Matrix-Vector Multiplication on GPUs" by Yuyao Niu, Zhengyang…
☆13Aug 12, 2022Updated 3 years ago
PASSIONLab / MaskedSpGEMM
View on GitHub
☆10Jul 4, 2022Updated 4 years ago
sunqm / pbchf
View on GitHub
An example to implement PBC SCF
☆14Jul 10, 2018Updated 8 years ago
SRI-CSL / Bliss
View on GitHub
BLISS: Bimodal Lattice Signature Schemes
☆31Jul 10, 2020Updated 6 years ago
dominiksalvet / limen-alpha
View on GitHub
Dual-core 16-bit RISC processor
☆12Jul 21, 2024Updated 2 years ago
tangjie1992 / MIMO
View on GitHub
MIMO precoding and detection algorithm
☆19Feb 26, 2018Updated 8 years ago