tenstorrent / tt-forge
Tenstorrent's MLIR Based Compiler. We aim to enable developers to run AI on all configurations of Tenstorrent hardware, through an open-source, general, and performant compiler.
☆12Updated this week
Alternatives and similar repositories for tt-forge:
Users that are interested in tt-forge are comparing it to the libraries listed below
- Attention in SRAM on Tenstorrent Grayskull☆32Updated 8 months ago
- High-Performance SGEMM on CUDA devices☆87Updated 2 months ago
- LLM training in simple, raw C/CUDA☆18Updated 10 months ago
- asynchronous/distributed speculative evaluation for llama3☆39Updated 7 months ago
- Tensor library with autograd using only Rust's standard library☆67Updated 9 months ago
- Learning about CUDA by writing PTX code.☆125Updated last year
- A minimalistic C++ Jinja templating engine for LLM chat templates☆128Updated this week
- Super fast FP32 matrix multiplication on RDNA3☆30Updated 2 months ago
- LLM training in simple, raw C/CUDA☆92Updated 11 months ago
- ☆12Updated 4 months ago
- Graph model execution API for Candle☆13Updated 4 months ago
- GGML implementation of BERT model with Python bindings and quantization.☆25Updated last year
- TT-Studio : An all-in-one platform to deploy and manage AI models optimized for Tenstorrent hardware with dedicated front-end demo applic…☆14Updated this week
- Implement Neural Networks in Cuda from Scratch☆22Updated 10 months ago
- An implementation of the transformer architecture onto an Nvidia CUDA kernel☆177Updated last year
- ☆47Updated this week
- Tensor library for Zig☆11Updated 4 months ago
- Schola is a plugin for enabling Reinforcement Learning (RL) in Unreal Engine. It provides tools to help developers create environments, d…☆33Updated last month
- Profile your CoreML models directly from Python 🐍☆27Updated 5 months ago
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)☆62Updated last week
- ☆54Updated 9 months ago
- An implementation of delta-iris in tinygrad☆72Updated 7 months ago
- pytorch from scratch in pure C/CUDA and python☆40Updated 5 months ago
- This repository is a read-only mirror of https://gitlab.arm.com/kleidi/kleidiai☆26Updated this week
- ☆28Updated 2 months ago
- ☆20Updated last month
- Lightweight Llama 3 8B Inference Engine in CUDA C☆47Updated last week
- Experimental GPU language with meta-programming☆22Updated 6 months ago
- ☆17Updated 2 weeks ago
- ☆12Updated 9 months ago