tinygrad / 7900xtx
☆382Updated this week
Related projects ⓘ
Alternatives and complementary repositories for 7900xtx
- Nvidia Instruction Set Specification Generator☆215Updated 4 months ago
- Reverse engineered Linux driver for the Apple Neural Engine (ANE).☆368Updated 8 months ago
- ctypes wrappers for HIP, CUDA, and OpenCL☆126Updated 4 months ago
- Because tinygrad got out of hand with line count☆146Updated last month
- It's a core. Made on Twitch.☆251Updated 3 years ago
- Run 64-bit Linux on LiteX + RocketChip☆188Updated 3 months ago
- NVIDIA Linux open GPU with P2P support☆916Updated 5 months ago
- parallelized hyperdimensional tictactoe☆110Updated 2 months ago
- If tinygrad wasn't small enough for you...☆654Updated 8 months ago
- A minimal Tensor Processing Unit (TPU) inspired by Google's TPUv1.☆117Updated 3 months ago
- Enabling tinygrad compatibility with the Google Edge TPU☆75Updated 2 months ago
- Sniff CUDA ioctls☆179Updated last year
- Apple GPU microarchitecture☆474Updated 2 months ago
- Solve puzzles to improve your tinygrad skills!☆87Updated 2 months ago
- Apple G13 GPU architecture docs and tools☆551Updated 6 months ago
- An implementation of delta-iris in tinygrad☆71Updated 3 months ago
- ☆1,003Updated last month
- TT-NN operator library, and TT-Metalium low level kernel programming model.☆478Updated this week
- Deep learning accelerator architectures requiring half the multipliers☆263Updated 7 months ago
- Scripts and environment for the tinybox☆92Updated 7 months ago
- ☆52Updated 5 months ago
- Apple AMX Instruction Set☆995Updated 5 months ago
- pytorch from scratch in pure C/CUDA and python☆37Updated last month
- Richard is gaining power☆176Updated 3 months ago
- ☆234Updated 8 months ago
- Exploring the scalable matrix extension of the Apple M4 processor☆135Updated 2 weeks ago
- High-Performance FP32 Matrix Multiplication on CPU☆301Updated last week
- RDNA3 emulator☆46Updated last week
- could we make an ml stack in 100,000 lines of code?☆26Updated 4 months ago
- ☆224Updated last month