GEMMul8 (GEMMulate): GEMM emulation using INT8/FP8 matrix engines based on the Ozaki Scheme II
☆59Apr 1, 2026Updated last week
Alternatives and similar repositories for GEMMul8
Users that are interested in GEMMul8 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Acceleration codes for the Ozaki-scheme on integer matrix multiplication units.☆24Dec 10, 2025Updated 4 months ago
- FP64 equivalent GEMM via Int8 Tensor Cores using the Ozaki scheme☆116Dec 2, 2025Updated 4 months ago
- Fast SGEMM emulation on Tensor Cores☆17Feb 16, 2025Updated last year
- Concurrent hash tries for C++ 14 with no memory management whatsoever.☆10Aug 30, 2016Updated 9 years ago
- Exchange correlation (XC) library for density functional theory (DFT) calculations in modern C++☆28Jan 20, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Generating contraction orders and perform numerical contractions for arbitrary tensor networks☆18May 20, 2024Updated last year
- OpenMP offload playground☆10Nov 16, 2024Updated last year
- FFVC - Frontflow/violet Cartesian☆14Apr 5, 2020Updated 6 years ago
- Training v0.7 results☆12Nov 18, 2025Updated 4 months ago
- ☆17Mar 27, 2026Updated 2 weeks ago
- ☆18Jan 2, 2026Updated 3 months ago
- Selene is an iCalendar parser for Ruby☆15Feb 16, 2019Updated 7 years ago
- High Availability Shared Pipeline Engine☆17Sep 15, 2023Updated 2 years ago
- CUDA Finite Difference Library☆16Aug 21, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- A restaurant reservation web application. Using Ruby on Rails, PostgreSQL, JavaScript/React/Redux, JSX, SCSS.☆14Jul 18, 2018Updated 7 years ago
- Digital paint mixing program based on the Kubelka-Munk equations. Implementation of : T. Lindemeier, J. M. Gülzow, and O. Deussen. 2018…☆14Sep 10, 2020Updated 5 years ago
- A nim library for making graphs with GraphViz and DOT (based on PyGraphviz)☆11Sep 7, 2021Updated 4 years ago
- Semi-Lagrangian Library☆17Oct 23, 2023Updated 2 years ago
- Takum arithmetic C99 reference implementation☆22Nov 24, 2025Updated 4 months ago
- ☆18Jun 12, 2023Updated 2 years ago
- MaRe leverages the power of Docker and Spark to run and scale your serial tools in MapReduce fashion.☆14Apr 12, 2022Updated 3 years ago
- Itoyori: A distributed multi-threading runtime system for global-view fork-join task parallelism☆22Feb 9, 2024Updated 2 years ago
- Stable, numerical Navier-Stokes solver for use in real-time simulation☆16Apr 6, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- variPEPS -- Versatile tensor network library for variational ground state simulations in two spatial dimensions☆18Mar 19, 2026Updated 3 weeks ago
- General, Hybrid and Optimized Sparse Toolkit (Bitbucket mirror)☆12Apr 8, 2021Updated 5 years ago
- ☆38May 23, 2025Updated 10 months ago
- ☆14Dec 3, 2024Updated last year
- Distributed number crunching with nim☆12Dec 19, 2023Updated 2 years ago
- ☆17Nov 3, 2025Updated 5 months ago
- A flexible, templated GPU library of neighbor search algorithms.☆12Jul 22, 2021Updated 4 years ago
- This repo contains the code of the paper "RayJoin: Fast and Precise Spatial Join", ICS'24☆11Updated this week
- A little library for using SIMD instructions for x86 and ARM, wrapping Agner Fog's vectorclass for x86 and filling some of its functional…☆17Dec 10, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- A library for code transformations with guaranteed legality☆18Updated this week
- EigenKernel - a package of hybrid parallel solvers for eigenvalue problems☆15Jul 11, 2021Updated 4 years ago
- Distributions is a Nim library for distributions and their functions.☆18Jul 16, 2022Updated 3 years ago
- Sort 1..25 values with conditional swaps☆17Aug 6, 2024Updated last year
- Fortran language support for Atom-IDE☆22Mar 25, 2019Updated 7 years ago
- Website to compare Python package downloads☆45Mar 3, 2026Updated last month
- About Code release for "FlashBias: Fast Computation of Attention with Bias" (NeurIPS 2025), https://arxiv.org/abs/2505.12044☆26Nov 17, 2025Updated 4 months ago