☆14Dec 5, 2024Updated last year
Alternatives and similar repositories for autoGEMM
Users that are interested in autoGEMM are comparing it to the libraries listed below
Sorting:
- ☆33Mar 31, 2025Updated 11 months ago
- Acceleration codes for the Ozaki-scheme on integer matrix multiplication units.☆21Dec 10, 2025Updated 2 months ago
- An implementation of HPL-AI Mixed-Precision Benchmark based on hpl-2.3☆29May 30, 2021Updated 4 years ago
- Create and deploy virtual-experiments - co-processing computational workflows☆10Jan 28, 2026Updated last month
- ext_mpi_collectives☆11Apr 1, 2025Updated 11 months ago
- PARADIS, a lightweight and flexible weather forecast model that tries to Keep It Simple.☆26Feb 4, 2026Updated 3 weeks ago
- Memory Topology for GPUs☆17Feb 13, 2026Updated 2 weeks ago
- Cross-platform, high-throughput computing utility for processing shell commands over a distributed, asynchronous queue.☆41Jun 19, 2025Updated 8 months ago
- ☆10Dec 27, 2020Updated 5 years ago
- ☆11Feb 5, 2017Updated 9 years ago
- ☆11Oct 15, 2024Updated last year
- ☆12Aug 17, 2022Updated 3 years ago
- Chinese word segmentation with the neural seq2seq model implement in pytorch☆10Dec 13, 2017Updated 8 years ago
- How to build an ACP compliant agent that uses MCP as well!☆11May 6, 2025Updated 9 months ago
- EPOCH Input System Version 2☆10Jun 5, 2020Updated 5 years ago
- ☆11Feb 27, 2024Updated 2 years ago
- OpenMP offload playground☆10Nov 16, 2024Updated last year
- 2D time-domain isotropic (visco)elastic FD modeling and full waveform inversion (FWI) code for SH-waves☆13Aug 9, 2020Updated 5 years ago
- Code for paper "Beyond Closure Models: Learning Chaotic Systems via Physics-Informed Neural Operators".☆14Dec 24, 2025Updated 2 months ago
- ☆10Updated this week
- Statistical discontinuous constituent parsing☆11Feb 15, 2018Updated 8 years ago
- Argonne Leadership Computing Facility OpenCL tutorial☆10Aug 22, 2025Updated 6 months ago
- Performance Counter Reader☆11Sep 14, 2022Updated 3 years ago
- GPU based 2D elastic FWI☆12Mar 6, 2018Updated 7 years ago
- CPU and GPU tutorial examples☆13Apr 4, 2025Updated 10 months ago
- OpenVINO LLM Benchmark☆11Dec 7, 2023Updated 2 years ago
- Global Address SPace toolbox -- Julia wrapper☆10Nov 17, 2017Updated 8 years ago
- PCB libraries and templates for rocket-chip based FPGA/ASIC designs☆15Feb 24, 2026Updated last week
- Continuum Dynamics Evaluation and Test Suite☆15Aug 29, 2017Updated 8 years ago
- FMS Model Optimizer is a framework for developing reduced precision neural network models.☆21Feb 23, 2026Updated last week
- The goal of this design is to use the PYNQ-Z2 development board to design a general convolution neural network accelerator. And through r…☆11Sep 30, 2020Updated 5 years ago
- Enhancing the convergence speed by 2x and improving the training success of Physics-Informed Neural Networks (PINNs).☆13Oct 14, 2024Updated last year
- ☆11Dec 9, 2022Updated 3 years ago
- SODECL is a library of ordinary differential equation (ODE) and stochastic differential equation (SDE) solvers in OpenCL.☆11Jul 4, 2020Updated 5 years ago
- Scripts for viewing Slurm batch job resource usages☆11Jan 3, 2022Updated 4 years ago
- Sequential Parameter Optimization in Python☆14Jan 12, 2026Updated last month
- ☆12Aug 4, 2025Updated 6 months ago
- Open source code for AlphaFold.☆12Nov 15, 2025Updated 3 months ago
- Reference implementation for the climate segmentation benchmark, based on the Exascale Deep Learning for Climate Analytics work☆10May 6, 2020Updated 5 years ago