Triton for OpenCL backend, and use mlir-translate to get source OpenCL code
☆24Aug 27, 2025Updated 6 months ago
Alternatives and similar repositories for triton-ocl
Users that are interested in triton-ocl are comparing it to the libraries listed below
Sorting:
- Shor's algorithm simulation using CUDA☆19Nov 10, 2019Updated 6 years ago
- ☆23Jun 11, 2025Updated 8 months ago
- ☆26Aug 28, 2024Updated last year
- cuJSON: A Highly Parallel JSON Parser for GPUs☆40Dec 12, 2025Updated 2 months ago
- ☆65Apr 26, 2025Updated 10 months ago
- An experimental communicating attention kernel based on DeepEP.☆35Jul 29, 2025Updated 7 months ago
- ARCHIE is a QEMU-based architecture-independent fault evaluation tool, that is able to simulate transient and permanent instruction and d…☆33Jan 3, 2026Updated 2 months ago
- Translating human input as kubectl commands using LLMs powered by Yacana☆12Mar 2, 2026Updated last week
- Writing a CUDA software ray tracing renderer with Analysis-Driven Optimization from scratch: a python-importable, distributed parallel re…☆37Oct 5, 2025Updated 5 months ago
- [ICML 2025] RocketKV: Accelerating Long-Context LLM Inference via Two-Stage KV Cache Compression☆34Aug 7, 2025Updated 7 months ago
- ☆10Sep 10, 2025Updated 6 months ago
- Luthier, a GPU binary instrumentation tool for AMD GPUs☆27Updated this week
- instant cross-platform jit engine inspired by Xbyak☆11Feb 22, 2026Updated 2 weeks ago
- Examples for the HEBI Robotics Python API☆14Jan 9, 2026Updated 2 months ago
- PTX-EMU is a simple emulator for CUDA program.☆38Apr 25, 2025Updated 10 months ago
- An experimental CPU backend for Triton☆181Feb 25, 2026Updated last week
- FlagTree is a unified compiler supporting multiple AI chip backends for custom Deep Learning operations, which is forked from triton-lang…☆214Mar 3, 2026Updated last week
- Official repository for the paper Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regressi…☆23Oct 1, 2025Updated 5 months ago
- Word2Vec 任务的并行计算实现☆11Sep 11, 2017Updated 8 years ago
- FELICS Framework☆11Dec 5, 2019Updated 6 years ago
- ☆13Jan 16, 2026Updated last month
- gcnano-binaries☆11Feb 9, 2026Updated last month
- Python library for Alphanov's PDM laser sources control☆13Feb 24, 2026Updated last week
- A learning project for getting newcomers started with a WASM JIT compiler☆14Feb 28, 2026Updated last week
- Converts Python3 .py files into .exe and makes it so the file can run on any environment without installing python3.☆11Jun 7, 2018Updated 7 years ago
- ECED440 Computer Security☆11Nov 6, 2024Updated last year
- Tutorial on building a gpu compiler backend in LLVM☆55Jan 11, 2025Updated last year
- Experiments on Multi-Head Latent Attention☆100Aug 19, 2024Updated last year
- Intruder.py - A powerful tool to customize attacks on websites. Has 4 different options of attacks☆12Dec 1, 2020Updated 5 years ago
- A docker image for One Student One Chip's debug exam☆10Sep 22, 2023Updated 2 years ago
- USB library for ChipWhisperer devices☆15Jan 22, 2026Updated last month
- Ghidra Loader for ESP32 Flash Dumps☆15Feb 10, 2025Updated last year
- ☆12Aug 17, 2020Updated 5 years ago
- ☆15Dec 9, 2025Updated 3 months ago
- An automated toolkit to analyze and detect changes in secure hardware and cryptographic libraries. SCRUTINY provides high-level framework…☆16Feb 12, 2026Updated 3 weeks ago
- GOST-34.11-2012 (Stribog) hash-function☆11May 12, 2015Updated 10 years ago
- Higher-order Masking of AES-128 based on the Rivain and Prouff method, CPRR method and Common Shares with Random Reduction method.☆14May 13, 2017Updated 8 years ago
- RISC-V vector and tensor compute extensions for Vortex GPGPU acceleration for ML workloads. Optimized for transformer models, CNNs, and g…☆21Apr 25, 2025Updated 10 months ago
- ☆13Jul 2, 2025Updated 8 months ago