geohot / ctypeslib
Generate python ctypes classes from C headers. Requires LLVM clang
☆13Updated 8 months ago
Alternatives and similar repositories for ctypeslib:
Users that are interested in ctypeslib are comparing it to the libraries listed below
- ctypes wrappers for HIP, CUDA, and OpenCL☆129Updated 10 months ago
- FP4 MAC Array☆17Updated last year
- RDNA3 emulator☆54Updated 3 weeks ago
- The Finite Field Assembly Programming Language☆36Updated last month
- Custom PTX Instruction Benchmark☆123Updated 2 months ago
- Fork of Triton repository for OpenXLA uses of the Triton language and compiler☆11Updated this week
- Learning about CUDA by writing PTX code.☆129Updated last year
- Write a fast kernel and run it on Discord. See how you compare against the best!☆44Updated this week
- LLM training in simple, raw C/CUDA☆94Updated last year
- ☆13Updated 10 months ago
- High-Performance SGEMM on CUDA devices☆90Updated 3 months ago
- pytorch from scratch in pure C/CUDA and python☆40Updated 7 months ago
- Standalone commandline CLI tool for compiling Triton kernels☆18Updated 7 months ago
- asynchronous/distributed speculative evaluation for llama3☆39Updated 9 months ago
- ☆47Updated last year
- Python bindings for ggml☆140Updated 8 months ago
- Because it's there.☆16Updated 7 months ago
- Explore training for quantized models☆18Updated 4 months ago
- Solving floating point SMT constraints on a GPU☆48Updated 4 years ago
- LLVM-Canon aims to transform LLVM modules into a canonical form by reordering and renaming instructions while preserving the same semanti…☆15Updated last year
- Experiments with BitNet inference on CPU☆54Updated last year
- Loop Nest - Linear algebra compiler and code generator.☆22Updated 2 years ago
- FlexAttention w/ FlashAttention3 Support☆26Updated 7 months ago
- Reverse Engineering Micro-architectural Features☆10Updated 4 years ago
- Gpu benchmark☆60Updated 3 months ago
- Jax like function transformation engine but micro, microjax☆31Updated 6 months ago
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆111Updated this week
- ☆16Updated last year
- ☆24Updated 8 months ago
- Architecture mapping proofs written in Agda for the paper "Lasagne: A Static Binary Translator for Weak Memory Model Architectures"☆13Updated 3 years ago