GEMMul8 (GEMMulate): GEMM emulation using INT8/FP8 matrix engines based on the Ozaki Scheme II
☆72Jun 8, 2026Updated this week
Alternatives and similar repositories for GEMMul8
Users that are interested in GEMMul8 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Acceleration codes for the Ozaki-scheme on integer matrix multiplication units.☆25Dec 10, 2025Updated 6 months ago
- FP64 equivalent GEMM by the Ozaki scheme with Int8 Tensor Cores☆120Dec 2, 2025Updated 6 months ago
- Fast SGEMM emulation on Tensor Cores☆17Feb 16, 2025Updated last year
- Concurrent hash tries for C++ 14 with no memory management whatsoever.☆10Aug 30, 2016Updated 9 years ago
- OpenMP offload playground☆10Nov 16, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- FFVC - Frontflow/violet Cartesian☆14Apr 5, 2020Updated 6 years ago
- Training v0.7 results☆12Nov 18, 2025Updated 6 months ago
- This is an old archived repository that we keep for our records. Please use recent GENESIS repository and do not use this one.☆11Sep 15, 2022Updated 3 years ago
- ☆19Jan 2, 2026Updated 5 months ago
- An extension library of WMMA API (Tensor Core API)☆113Jul 12, 2024Updated last year
- Selene is an iCalendar parser for Ruby☆15Feb 16, 2019Updated 7 years ago
- High Availability Shared Pipeline Engine☆17Sep 15, 2023Updated 2 years ago
- Plane-Wave density-functional theory (DFT) development for NWChemEx electronic structure software☆13Updated this week
- ☆16May 17, 2018Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A restaurant reservation web application. Using Ruby on Rails, PostgreSQL, JavaScript/React/Redux, JSX, SCSS.☆14Jul 18, 2018Updated 7 years ago
- Digital paint mixing program based on the Kubelka-Munk equations. Implementation of : T. Lindemeier, J. M. Gülzow, and O. Deussen. 2018…☆14Sep 10, 2020Updated 5 years ago
- A nim library for making graphs with GraphViz and DOT (based on PyGraphviz)☆11Apr 25, 2026Updated last month
- Takum arithmetic C99 reference implementation☆25Nov 24, 2025Updated 6 months ago
- MaRe leverages the power of Docker and Spark to run and scale your serial tools in MapReduce fashion.☆14Apr 12, 2022Updated 4 years ago
- SALMON 2.0.0 Development Repository☆15Updated this week
- Itoyori: A distributed multi-threading runtime system for global-view fork-join task parallelism☆23Feb 9, 2024Updated 2 years ago
- variPEPS -- Versatile tensor network library for variational ground state simulations in two spatial dimensions☆18May 29, 2026Updated last week
- A high-performance implementation of Empirical Dynamic Modeling (EDM)☆20Feb 25, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- General, Hybrid and Optimized Sparse Toolkit (Bitbucket mirror)☆12Apr 8, 2021Updated 5 years ago
- ☆17May 28, 2026Updated last week
- A flexible, templated GPU library of neighbor search algorithms.☆11Jul 22, 2021Updated 4 years ago
- This repo contains the code of the paper "RayJoin: Fast and Precise Spatial Join", ICS'24☆12Updated this week
- EigenKernel - a package of hybrid parallel solvers for eigenvalue problems☆15Jul 11, 2021Updated 4 years ago
- Distributions is a Nim library for distributions and their functions.☆18Jul 16, 2022Updated 3 years ago
- Sort 1..25 values with conditional swaps☆17Aug 6, 2024Updated last year
- Fortran language support for Atom-IDE☆22Mar 25, 2019Updated 7 years ago
- Website to compare Python package downloads☆46Updated this week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆43Apr 27, 2026Updated last month
- About Code release for "FlashBias: Fast Computation of Attention with Bias" (NeurIPS 2025), https://arxiv.org/abs/2505.12044☆29Nov 17, 2025Updated 6 months ago
- This is an example of a boolean expression editor made in Dear ImGui☆15Dec 3, 2022Updated 3 years ago
- TTG: Template Task Graph C++ API☆26May 9, 2026Updated last month
- GEMV implementation with CUTLASS☆21Aug 21, 2025Updated 9 months ago
- Medusa: Accelerating Serverless LLM Inference with Materialization [ASPLOS'25]☆12Nov 8, 2024Updated last year
- A copy of the DirectX Headers from MinGW-64.☆14Sep 7, 2023Updated 2 years ago