A lightweight triton-based General Matrix Multiplication (GEMM) library.
☆57Apr 8, 2026Updated this week
Alternatives and similar repositories for tritonBLAS
Users that are interested in tritonBLAS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10May 15, 2024Updated last year
- A Triton JIT runtime and ffi provider in C++☆32Updated this week
- LLVM/MLIR based compiler instrumentation of AMD GPU kernels☆20Jul 13, 2025Updated 9 months ago
- ☆16Nov 10, 2025Updated 5 months ago
- ☆20Sep 28, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- HIP backend patch for Numba, the NumPy aware dynamic Python compiler using LLVM.☆19Feb 16, 2026Updated last month
- Tensor library for machine learning☆27Apr 4, 2026Updated last week
- A dynamic GPU memory allocator, suitable for warp synchronized scenarios.☆11Aug 20, 2019Updated 6 years ago
- ☆12Mar 14, 2024Updated 2 years ago
- Taichi Course 01 Final Project Template☆13Dec 13, 2021Updated 4 years ago
- Open-source library for Graph Streaming. Solves the connected components problem using sub-linear space. Published in SIGMOD'22.☆10Apr 6, 2026Updated last week
- back up of my clothes project☆14Dec 18, 2016Updated 9 years ago
- ☆13Nov 4, 2020Updated 5 years ago
- A reference implementation of std::simd, providing data parallel types in the C++ standard☆14Mar 9, 2020Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆13Nov 25, 2021Updated 4 years ago
- Source code supporting the High Performance Graphics 2022 paper: Supporting Unified Shader Specialization by Co-opting C++ Features☆14Jul 9, 2022Updated 3 years ago
- Implementation of various equivariant models in JAX☆19Apr 12, 2024Updated 2 years ago
- ☆16Aug 16, 2020Updated 5 years ago
- ☆24May 18, 2025Updated 10 months ago
- PaiNN in jax☆11Jan 14, 2025Updated last year
- Hierarchical Loss function☆13May 6, 2019Updated 6 years ago
- Docker image for☆11Dec 25, 2017Updated 8 years ago
- ☆50Apr 7, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Cloth Simulation with WebGPU Compute Shader☆20Jun 5, 2025Updated 10 months ago
- Matching algorithms for LightGraphs.jl☆13Oct 21, 2021Updated 4 years ago
- Distributed machine learning platform☆13Aug 20, 2015Updated 10 years ago
- Scale-out system monitoring☆21Apr 8, 2026Updated last week
- A WebGL-based real-time fluid solver.☆20Jul 12, 2017Updated 8 years ago
- Collection of open source OpenGL demos, graphics prototypes and physics sims.☆15May 29, 2021Updated 4 years ago
- Benchmarking scripts for Gaia☆14Apr 10, 2025Updated last year
- ☆13Jun 2, 2024Updated last year
- ☆29Sep 6, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- SapienIPC experimental release. This release is temporary and will not be maintained. We will release a stable version soon.☆21Dec 8, 2025Updated 4 months ago
- ☆17Mar 26, 2025Updated last year
- Tensor Parallelism with JAX + Shard Map☆11Sep 29, 2023Updated 2 years ago
- A 3D mass-spring real world simulator with more types of forces(gravity, electricity, spring, collision, ...)☆19Sep 4, 2020Updated 5 years ago
- Incomplete-Cholesky preconditioned conjugate gradient algorithm implemented with cuBLAS/cuSPARSE☆12Jun 24, 2022Updated 3 years ago
- In this folder my all Python codes are stored.☆17May 2, 2021Updated 4 years ago
- Automated bottleneck detection and solution orchestration☆20Feb 24, 2026Updated last month