Triton backend for managing the model state tensors automatically in sequence batcher
☆16Feb 12, 2024Updated 2 years ago
Alternatives and similar repositories for stateful_backend
Users that are interested in stateful_backend are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆33Jul 7, 2022Updated 3 years ago
- The core library and APIs implementing the Triton Inference Server.☆170Mar 18, 2026Updated last week
- Community Eventing and Scripting examples☆19Aug 11, 2025Updated 7 months ago
- Script for converting kaldi GMM/HMM models to HTK format☆11Jul 18, 2024Updated last year
- Homebrew formulas for installing LLM and related tools☆15Sep 6, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- API and CLI tool to fetch and query Chome DevTools heap snapshots (Python & Playwright)☆16May 16, 2024Updated last year
- ☆11Dec 11, 2024Updated last year
- PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.☆838Aug 13, 2025Updated 7 months ago
- [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models☆11Dec 13, 2023Updated 2 years ago
- ☆18May 6, 2023Updated 2 years ago
- A jupyter client for your terminal☆24Jan 3, 2026Updated 2 months ago
- Nsight Compute In Docker☆13Dec 21, 2023Updated 2 years ago
- Fast and memory-efficient exact attention☆21Mar 13, 2026Updated 2 weeks ago
- ☆36Feb 9, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- An agent that can run everywhere - even in your watch!☆30Mar 5, 2026Updated 3 weeks ago
- ☆16Nov 26, 2024Updated last year
- ☆33Jan 30, 2026Updated last month
- Book code for Testing in Scala on O'Reilly☆14May 29, 2014Updated 11 years ago
- Beep the PC speaker☆11Nov 9, 2022Updated 3 years ago
- Student version of Mini-SLAM.☆10Mar 16, 2024Updated 2 years ago
- Provides for deploying custom ETL containers on AIStore, with subsequent user-defined extraction-transformation-loading in parallel, on t…☆19Mar 19, 2026Updated last week
- GenericInjector for win32 programs☆12Jun 19, 2017Updated 8 years ago
- The Triton backend for the ONNX Runtime.☆172Mar 18, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- An in-memory compressed cache for gigabytes of data written in Go.☆19Feb 6, 2023Updated 3 years ago
- Compare Savant and PyTorch performance☆13Feb 9, 2024Updated 2 years ago
- Indirect Supervision for Relation Extraction Using Question-Answer Pairs (WSDM'18)☆24Nov 2, 2017Updated 8 years ago
- ☆18Apr 15, 2024Updated last year
- ☆16Nov 24, 2025Updated 4 months ago
- Simple Event Driven Network Library☆33Nov 7, 2017Updated 8 years ago
- ☆37May 5, 2013Updated 12 years ago
- A simple example of how to bind C++ code in Python☆14Nov 13, 2020Updated 5 years ago
- a performant, embedded database☆20Mar 4, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Typesafe Activator template for advanced play-slick project☆20Jan 16, 2017Updated 9 years ago
- A set of tools that make working with the Scala ecosystem even better.☆12Mar 16, 2026Updated last week
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆17May 20, 2025Updated 10 months ago
- A lightweight reactive RPC-like system built on Akka IO☆45Apr 23, 2015Updated 10 years ago
- This is a fork of SGLang for hip-attention integration. Please refer to hip-attention for detail.☆18Dec 23, 2025Updated 3 months ago
- ☆11Nov 15, 2016Updated 9 years ago
- Docker images for GStreamer☆15May 29, 2018Updated 7 years ago