☆47Mar 27, 2026Updated this week
Alternatives and similar repositories for tt-inference-server
Users that are interested in tt-inference-server are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The TT-Forge ONNX is a graph compiler designed to optimize and transform computational graphs for deep learning models, enhancing their p…☆54Updated this week
- TT-Studio : An all-in-one platform to deploy and manage AI models optimized for Tenstorrent hardware with dedicated front-end demo applic…☆40Updated this week
- Repository for AI model benchmarking on TT-Buda☆16Feb 9, 2026Updated last month
- A high-throughput and memory-efficient inference and serving engine for LLMs☆27Updated this week
- Tenstorrent Firmware repository☆24Feb 25, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Tenstorrent console based hardware information program☆60Mar 17, 2026Updated last week
- Tenstorrent Topology (TT-Topology) is a command line utility used to flash multiple NB cards on a system to use specific eth routing conf…☆16Feb 26, 2026Updated last month
- Boltz-2 implementation for inference on Tenstorrent hardware☆78Updated this week
- ☆37Updated this week
- Frontend integration for PyTorch with tt-mlir☆23Mar 2, 2026Updated 3 weeks ago
- TT-NN operator library, and TT-Metalium low level kernel programming model.☆1,388Updated this week
- ☆92Mar 16, 2026Updated last week
- 가짜연구소 9/10기 '스페셜한 Spatial AI' 스터디 레포입니다.☆32Jun 24, 2025Updated 9 months ago
- Repo for AI Compiler team. The intended purpose of this repo is for implementation of a PJRT device.☆56Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Tenstorrent Kernel Module☆61Updated this week
- Tenstorrent MLIR compiler☆254Updated this week
- System firmware for Tenstorrent hardware☆33Updated this week
- A comprehensive tool for visualizing and analyzing model execution, offering interactive graphs, memory plots, tensor details, buffer ove…☆48Updated this week
- Tenstorrent's MLIR Based Compiler. We aim to enable developers to run AI on all configurations of Tenstorrent hardware, through an open-s…☆193Updated this week
- TVM for Tenstorrent ASICs☆28Sep 8, 2025Updated 6 months ago
- Extracts addresses from .pbf files to .csv files☆12Apr 13, 2015Updated 10 years ago
- Perf monitoring CLI tool for Apple Silicon☆16Jan 1, 2024Updated 2 years ago
- Awesome Resources about MegEngine☆16Mar 2, 2023Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Attention in SRAM on Tenstorrent Grayskull☆39Jul 18, 2024Updated last year
- PolyLib official git.☆11Jan 27, 2026Updated 2 months ago
- MLIR-based partitioning system☆174Updated this week
- Musings in GEMM (General Matrix Multiplication)☆14Dec 14, 2025Updated 3 months ago
- A python package to create flask project with svelte frontend.☆17Jan 20, 2024Updated 2 years ago
- Full text indexing of syslog messages with solr☆22Sep 5, 2011Updated 14 years ago
- A fast full-system simulator of Tenstorrent hardware☆44Mar 20, 2026Updated last week
- 在PyTorch上重构multi-agent deep deterministic policy gradient(MADDPG),将https://github.com/xuemei-ye/maddpg-mpe 修改到自己电脑上可运行。因为本人笔记本没有CUDA,实验速度…☆14May 10, 2019Updated 6 years ago
- A light wrapper around tmux to manage sessions☆30Mar 26, 2013Updated 13 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆14Mar 17, 2026Updated last week
- This repository contains the models and training scripts used in the papers: "Quantizing Spiking Neural Networks with Integers" (ICONS 20…☆13Oct 20, 2020Updated 5 years ago
- API for parse.com in node.js☆44Aug 16, 2016Updated 9 years ago
- [NeurIPS 2020] "FracTrain: Fractionally Squeezing Bit Savings Both Temporally and Spatially for Efficient DNN Training" by Yonggan Fu, Ha…☆10Feb 13, 2022Updated 4 years ago
- TK8 provisioner for using Terraform Provider Rancher2 with TK8☆12Feb 21, 2020Updated 6 years ago
- Retargetable ML compilers for the twenty-first century!☆13Apr 22, 2025Updated 11 months ago
- RISC-V Directed Test Framework and Compliance Suite, RiESCUE☆60Mar 20, 2026Updated last week