tinygrad / tinyosLinks
☆35Updated 2 weeks ago
Alternatives and similar repositories for tinyos
Users that are interested in tinyos are comparing it to the libraries listed below
Sorting:
- Ultra low overhead NVIDIA GPU telemetry plugin for telegraf with memory temperature readings.☆62Updated last year
- tiny code to access tenstorrent blackhole☆57Updated 2 months ago
- An implementation of delta-iris in tinygrad☆72Updated 11 months ago
- PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP☆99Updated 3 weeks ago
- Modded vLLM to run pipeline parallelism over public networks☆37Updated 2 months ago
- SIMD quantization kernels☆78Updated this week
- Prepare for DeekSeek R1 inference: Benchmark CPU, DRAM, SSD, iGPU, GPU, ... with efficient code.☆72Updated 6 months ago
- Can RL solve simple problems?☆54Updated last year
- ☆87Updated last week
- DeMo: Decoupled Momentum Optimization☆190Updated 8 months ago
- ctypes wrappers for HIP, CUDA, and OpenCL☆130Updated last year
- Solve puzzles to improve your tinygrad skills!☆141Updated 4 months ago
- ☆24Updated last year
- ☆19Updated 7 months ago
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆126Updated 3 months ago
- Transformer GPU VRAM estimator☆66Updated last year
- The Finite Field Assembly Programming Language☆36Updated 2 months ago
- noise_step: Training in 1.58b With No Gradient Memory☆220Updated 7 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆103Updated 5 months ago
- Solidity contracts for the decentralized Prime Network protocol☆24Updated last month
- Simple Transformer in Jax☆138Updated last year
- Inference of Mamba models in pure C☆189Updated last year
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)☆66Updated 4 months ago
- Learning about CUDA by writing PTX code.☆133Updated last year
- peer-to-peer compute and intelligence network that enables decentralized AI development at scale☆96Updated 2 weeks ago
- Train neural networks that distill into logic circuits, using JAX☆63Updated 2 months ago
- Custom PTX Instruction Benchmark☆126Updated 5 months ago
- Long context evaluation for large language models☆220Updated 5 months ago
- ☆30Updated 7 months ago
- Tenstorrent's MLIR Based Compiler. We aim to enable developers to run AI on all configurations of Tenstorrent hardware, through an open-s…☆96Updated this week