LeetArxiv / Finite-Field-AssemblyLinks
The Finite Field Assembly Programming Language
☆36Updated 2 months ago
Alternatives and similar repositories for Finite-Field-Assembly
Users that are interested in Finite-Field-Assembly are comparing it to the libraries listed below
Sorting:
- tiny code to access tenstorrent blackhole☆57Updated 2 months ago
- Tensor library & inference framework for machine learning☆106Updated 3 weeks ago
- A massively parallel, optimal functional runtime in Rust☆31Updated last year
- High-Performance SGEMM on CUDA devices☆98Updated 6 months ago
- asynchronous/distributed speculative evaluation for llama3☆39Updated last year
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆126Updated 3 months ago
- Learning about CUDA by writing PTX code.☆133Updated last year
- A tiny autograd engine with a Jax-like API☆74Updated last month
- Framework for specifying and proving properties—such as robustness, fairness, and interpretability—of machine learning models using Lean …☆64Updated last week
- Train neural networks that distill into logic circuits, using JAX☆63Updated 2 months ago
- RDNA3 emulator☆54Updated 3 months ago
- Rust bindings to GAP (Groups, Algorithms, Programming)☆28Updated 2 years ago
- a categorical deep learning compiler☆203Updated 5 months ago
- Custom PTX Instruction Benchmark☆126Updated 5 months ago
- Can I make an *optimizing* compiler under 1k lines of code?☆60Updated 5 months ago
- ctypes wrappers for HIP, CUDA, and OpenCL☆130Updated last year
- Samples of good AI generated CUDA kernels☆86Updated 2 months ago
- parallelized hyperdimensional tictactoe☆118Updated 11 months ago
- Editor with LLM generation tree exploration☆73Updated 5 months ago
- Ultra low overhead NVIDIA GPU telemetry plugin for telegraf with memory temperature readings.☆62Updated last year
- DiscoGrad - automatically differentiate across conditional branches in C++ programs☆204Updated 10 months ago
- Hashed Lookup Table based Matrix Multiplication (halutmatmul) - Stella Nera accelerator☆211Updated last year
- Training GPTs to solve interaction nets☆17Updated 11 months ago
- Multi-Threaded FP32 Matrix Multiplication on x86 CPUs☆350Updated 3 months ago
- Lightweight Llama 3 8B Inference Engine in CUDA C☆47Updated 4 months ago
- A package for defining deep learning models using categorical algebraic expressions.☆61Updated last year
- Heirarchical Navigable Small Worlds☆98Updated 4 months ago
- Attempt at Neuralink's Compression Challenge☆86Updated last year
- ☆18Updated last year
- Inference RWKV v7 in pure C.☆37Updated 2 weeks ago