tinygrad / tinyosLinks
☆39Updated last week
Alternatives and similar repositories for tinyos
Users that are interested in tinyos are comparing it to the libraries listed below
Sorting:
- An implementation of delta-iris in tinygrad☆72Updated last year
- SIMD quantization kernels☆87Updated last month
- Ultra low overhead NVIDIA GPU telemetry plugin for telegraf with memory temperature readings.☆63Updated last year
- Modded vLLM to run pipeline parallelism over public networks☆39Updated 4 months ago
- Can RL solve simple problems?☆54Updated last year
- PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP☆126Updated 3 weeks ago
- tiny code to access tenstorrent blackhole☆59Updated 4 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆105Updated 7 months ago
- Simple Transformer in Jax☆139Updated last year
- ☆24Updated last year
- ctypes wrappers for HIP, CUDA, and OpenCL☆130Updated last year
- A really tiny autograd engine☆95Updated 4 months ago
- Learning about CUDA by writing PTX code.☆138Updated last year
- noise_step: Training in 1.58b With No Gradient Memory☆221Updated 9 months ago
- Quantized LLM training in pure CUDA/C++.☆180Updated this week
- 👷 Build compute kernels☆155Updated this week
- This repository contain the simple llama3 implementation in pure jax.☆70Updated 7 months ago
- ☆21Updated 9 months ago
- NanoGPT-speedrunning for the poor T4 enjoyers☆72Updated 5 months ago
- Prepare for DeekSeek R1 inference: Benchmark CPU, DRAM, SSD, iGPU, GPU, ... with efficient code.☆73Updated 8 months ago
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)☆66Updated 6 months ago
- Standalone commandline CLI tool for compiling Triton kernels☆18Updated last year
- Solve puzzles to improve your tinygrad skills!☆144Updated 6 months ago
- RDNA3 emulator☆54Updated 5 months ago
- DeMo: Decoupled Momentum Optimization☆193Updated 10 months ago
- look how they massacred my boy☆63Updated 11 months ago
- ☆93Updated last week
- Write a fast kernel and run it on Discord. See how you compare against the best!☆58Updated last week
- The Prime Intellect CLI provides a powerful command-line interface for managing GPU resources across various providers☆97Updated this week
- in this repository, i'm going to implement increasingly complex llm inference optimizations☆68Updated 4 months ago