Triton backend for managing the model state tensors automatically in sequence batcher
☆17Feb 12, 2024Updated 2 years ago
Alternatives and similar repositories for stateful_backend
Users that are interested in stateful_backend are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TRITONCACHE implementation of a Redis cache☆17Updated this week
- Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.☆222May 27, 2026Updated 2 weeks ago
- Homebrew formulas for installing LLM and related tools☆14Sep 6, 2023Updated 2 years ago
- A Kubernetes operator for managing Prefect servers and work pools☆17Jun 8, 2026Updated last week
- ☆11Dec 11, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- lshash for python3☆10Mar 21, 2018Updated 8 years ago
- This repository corresponds to the PICCO compiler for secure multi-party computation published in 2013 with more recent efficiency improv…☆12Apr 30, 2026Updated last month
- PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.☆844Aug 13, 2025Updated 10 months ago
- An HTTP proxy that naively injects NTLM data for the current user into outgoing requests☆14Nov 14, 2018Updated 7 years ago
- [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models☆11Dec 13, 2023Updated 2 years ago
- A lock-free, thread-safe, multi-producer/multi-consumer queue based on the LMAX Disruptor.☆16Apr 20, 2021Updated 5 years ago
- Arxiv + Notion Sync☆20May 12, 2025Updated last year
- Nsight Compute In Docker☆13Dec 21, 2023Updated 2 years ago
- ☆36Feb 9, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Fast and memory-efficient exact attention☆21Apr 10, 2026Updated 2 months ago
- ☆17Nov 26, 2024Updated last year
- Beep the PC speaker☆11Nov 9, 2022Updated 3 years ago
- Noncanonical (but only existing) repo for the pijnu PEG parser☆25Jul 30, 2011Updated 14 years ago
- ☆10Mar 16, 2023Updated 3 years ago
- convert PubLayNet data into METS/PAGE-XML☆10Mar 17, 2020Updated 6 years ago
- ☆12May 22, 2022Updated 4 years ago
- Official Implementation of "Transferring Inductive Biases Through Knowledge Distillation"☆15Jun 3, 2020Updated 6 years ago
- The Triton backend for the ONNX Runtime.☆177Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Compare Savant and PyTorch performance☆13Feb 9, 2024Updated 2 years ago
- ☆18Apr 15, 2024Updated 2 years ago
- ICP implementation in Rust☆17Jun 27, 2024Updated last year
- ☆16Nov 24, 2025Updated 6 months ago
- Simple Event Driven Network Library☆33Nov 7, 2017Updated 8 years ago
- Beep, as an ALSA MIDI device☆14Mar 29, 2021Updated 5 years ago
- ☆37May 5, 2013Updated 13 years ago
- a performant, embedded database☆20Mar 4, 2025Updated last year
- Typesafe Activator template for advanced play-slick project☆20Jan 16, 2017Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A set of tools that make working with the Scala ecosystem even better.☆13Updated this week
- A lightweight reactive RPC-like system built on Akka IO☆45Apr 23, 2015Updated 11 years ago
- This is a fork of SGLang for hip-attention integration. Please refer to hip-attention for detail.☆18Mar 31, 2026Updated 2 months ago
- Android app to be used with the 5x5 Stronglifts strength training program.☆10Mar 21, 2018Updated 8 years ago
- A fork of the PEFT library, supporting Robust Adaptation (RoSA)☆15Aug 16, 2024Updated last year
- Universal Character Recognizer (UCR): Simple, Intuitive, Extensible, Multi-Lingual OCR engine☆15Apr 23, 2021Updated 5 years ago
- A simple wrapper of an IO computation to show the used CPU time.☆17Mar 14, 2025Updated last year