geohot / ctypeslib
Generate python ctypes classes from C headers. Requires LLVM clang
☆13Updated 8 months ago
Alternatives and similar repositories for ctypeslib:
Users that are interested in ctypeslib are comparing it to the libraries listed below
- ctypes wrappers for HIP, CUDA, and OpenCL☆129Updated 9 months ago
- FP4 MAC Array☆17Updated last year
- The Finite Field Assembly Programming Language☆36Updated last week
- LLVM-Canon aims to transform LLVM modules into a canonical form by reordering and renaming instructions while preserving the same semanti…☆14Updated 11 months ago
- pytorch from scratch in pure C/CUDA and python☆40Updated 6 months ago
- tenstorrent kernel from twitch☆27Updated last year
- Custom PTX Instruction Benchmark☆123Updated last month
- asynchronous/distributed speculative evaluation for llama3☆39Updated 8 months ago
- A tiny deep learning library written in Java☆25Updated 2 years ago
- LLM training in simple, raw C/CUDA☆92Updated 11 months ago
- Learning about CUDA by writing PTX code.☆128Updated last year
- Nvidia Instruction Set Specification Generator☆255Updated 9 months ago
- RDNA3 emulator☆54Updated this week
- Fork of Triton repository for OpenXLA uses of the Triton language and compiler☆11Updated this week
- ☆21Updated last month
- FlexAttention w/ FlashAttention3 Support☆26Updated 6 months ago
- Experiments with BitNet inference on CPU☆53Updated last year
- Standalone commandline CLI tool for compiling Triton kernels☆17Updated 7 months ago
- Attention in SRAM on Tenstorrent Grayskull☆33Updated 9 months ago
- A tracing JIT compiler for PyTorch☆13Updated 3 years ago
- Tenstorrent's MLIR Based Compiler. We aim to enable developers to run AI on all configurations of Tenstorrent hardware, through an open-s…☆39Updated this week
- ☆27Updated 9 months ago
- ☆47Updated last year
- ☆13Updated 10 months ago
- General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). …☆46Updated 2 months ago
- User-Mode Driver for Tenstorrent hardware☆20Updated this week
- KAN (Kolmogorov–Arnold Networks) in the MLX framework for Apple Silicon☆16Updated last month
- Thin wrapper around GGML to make life easier☆23Updated 2 weeks ago
- Reverse Engineering Micro-architectural Features☆10Updated 4 years ago
- ☆112Updated last year