Fast and efficient attention method exploration and implementation.
☆25Mar 25, 2025Updated last year
Alternatives and similar repositories for FlashMLA
Users that are interested in FlashMLA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An NVIDIA AI Workbench Example Project for Finetuning Llama 2☆35Aug 29, 2024Updated last year
- Emulating DMA Engines on GPUs for Performance and Portability☆42May 17, 2015Updated 10 years ago
- DLBlas: clean and efficient kernels☆39Apr 24, 2026Updated last week
- ☆120May 16, 2025Updated 11 months ago
- SC 2021, "LogECMem: Coupling Erasure-Coded In-Memory Key-Value Stores with Parity Logging"☆12Jul 12, 2021Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Export Blender (2.4x) curves to TikZ format for use with TeX☆13Apr 18, 2014Updated 12 years ago
- A tool to detect infrastructure issues on cloud native AI systems☆53Sep 18, 2025Updated 7 months ago
- create concept map from textbook data☆11May 4, 2018Updated 7 years ago
- ONCache: A Cache-Based Low-Overhead Container Overlay Network☆21Jun 7, 2025Updated 10 months ago
- ☆13Aug 1, 2025Updated 9 months ago
- a cloud-native workflow engine, also known as KubeAdaptor, a docking framework able to implement workflow containerization on Kubernetes…☆12Apr 21, 2022Updated 4 years ago
- Artifacts of EVT ASPLOS'24☆30Mar 6, 2024Updated 2 years ago
- EPOCH Input System Version 2☆10Jun 5, 2020Updated 5 years ago
- ☆74Updated this week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆17Dec 9, 2024Updated last year
- Home Page☆18Oct 4, 2025Updated 6 months ago
- High-performance LLM operator library built on TileLang.☆111Updated this week
- ☆16Oct 13, 2023Updated 2 years ago
- Global Address SPace toolbox -- Julia wrapper☆10Nov 17, 2017Updated 8 years ago
- G'MIC-Qt is a versatile front-end to the image processing framework G'MIC.☆17Mar 18, 2026Updated last month
- ☆20Nov 7, 2023Updated 2 years ago
- 🐝 Tiny CLI to post simultaneously to Mastodon and Bluesky☆17Apr 3, 2026Updated 3 weeks ago
- Prototype for a SPIR-V assembler and dissasembler. It provides a composable Java interface for generating SPIR-V code at runtime.☆14Oct 31, 2025Updated 6 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆19Nov 23, 2021Updated 4 years ago
- ☆16Sep 27, 2018Updated 7 years ago
- AMD’s C++ library for accelerating tensor primitives☆49Apr 22, 2026Updated last week
- Fast Approximate Membership Filters (C++)☆24Apr 27, 2021Updated 5 years ago
- Simple, lightweight transformers in Fortran☆17Nov 17, 2023Updated 2 years ago
- Voronoi Diagram implementation☆10Aug 3, 2019Updated 6 years ago
- Phi-2 Colab Notebook☆14Dec 14, 2023Updated 2 years ago
- Reconstruction of distorted underwater images using robust registration☆15Apr 16, 2019Updated 7 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆12Apr 23, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- SODECL is a library of ordinary differential equation (ODE) and stochastic differential equation (SDE) solvers in OpenCL.☆11Jul 4, 2020Updated 5 years ago
- ext_mpi_collectives☆11Mar 27, 2026Updated last month
- Rename files in the same way you edit text☆16Oct 1, 2025Updated 7 months ago
- Argonne Leadership Computing Facility OpenCL tutorial☆10Aug 22, 2025Updated 8 months ago
- Parallel SpMV using CSR representation, built in CUDA☆14Jun 27, 2020Updated 5 years ago
- The GNU MathProg implementation of OSeMOSYS☆12Nov 7, 2024Updated last year
- ☆10Dec 8, 2022Updated 3 years ago