openxla / triton
Fork of Triton repository for OpenXLA uses of the Triton language and compiler
☆11Updated this week
Alternatives and similar repositories for triton:
Users that are interested in triton are comparing it to the libraries listed below
- LLVM-Canon aims to transform LLVM modules into a canonical form by reordering and renaming instructions while preserving the same semanti…☆15Updated last year
- Main Repo for the OpenHW Group Software Task Group☆17Updated 2 months ago
- ☆14Updated 2 years ago
- ☆17Updated this week
- asynchronous/distributed speculative evaluation for llama3☆39Updated 9 months ago
- ☆59Updated this week
- Explore training for quantized models☆18Updated 4 months ago
- ☆13Updated 3 years ago
- General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). …☆46Updated 2 months ago
- a clone of POCL that includes RISC-V newlib devices support and Vortex☆41Updated last month
- JAX implementations of RWKV☆19Updated last year
- minimal C implementation of speculative decoding based on llama2.c☆22Updated 9 months ago
- ☆19Updated this week
- ☆51Updated 9 months ago
- Embedded Universal DSL: a good DSL for us, by us☆36Updated this week
- Course Project for COMP4471 on RWKV☆17Updated last year
- Web browser version of StarCoder.cpp☆45Updated last year
- Generate python ctypes classes from C headers. Requires LLVM clang☆13Updated 8 months ago
- A tracing JIT compiler for PyTorch☆13Updated 3 years ago
- int8_t and int16_t matrix multiply based on https://arxiv.org/abs/1705.01991☆71Updated last year
- Advanced Operating Systems project☆20Updated 8 months ago
- CMake modules used within the ROCm libraries☆66Updated this week
- Attention in SRAM on Tenstorrent Grayskull☆35Updated 9 months ago
- The AMD rocAL is designed to efficiently decode and process images and videos from a variety of storage formats and modify them through a…☆17Updated this week
- A server powering LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.☆13Updated 2 years ago
- SynapseAI Core is a reference implementation of the SynapseAI API running on Habana Gaudi☆40Updated 3 months ago
- ☆18Updated 10 months ago
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆40Updated last month
- A fork of OpenBLAS with Armv8-A SVE (Scalable Vector Extension) support☆17Updated 5 years ago
- TinyFive is a lightweight RISC-V emulator and assembler written in Python with neural network examples☆61Updated last year