A high-throughput and memory-efficient inference and serving engine for LLMs
☆30Jun 12, 2026Updated this week
Alternatives and similar repositories for vllm
Users that are interested in vllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tenstorrent Firmware repository☆24Feb 25, 2026Updated 3 months ago
- The Tenstorrent Studio (TT-Studio) is an easy to use web interface for running AI models on Tenstorrent hardware. It handles all the tech…☆48Updated this week
- Tenstorrent Firmware Update Utility☆13Jun 4, 2026Updated 2 weeks ago
- ☆58Updated this week
- RISC-V Directed Test Framework and Compliance Suite, RiESCUE☆66Jun 11, 2026Updated last week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆14Jun 12, 2026Updated last week
- Repository for AI model benchmarking on TT-Buda☆16Feb 9, 2026Updated 4 months ago
- Open source design for centrifugal air pump for open source ventilator☆11Apr 14, 2020Updated 6 years ago
- Jetson Nano control and vision with ROS2 RealSense2, RPlidar, BNO055, Python, Pygame and ModBus☆12Feb 13, 2021Updated 5 years ago
- ☆16Jul 24, 2023Updated 2 years ago
- The TT-Forge ONNX is a graph compiler designed to optimize and transform computational graphs for deep learning models, enhancing their p…☆64Updated this week
- ROS 2 Package to Publish Camera Image as sensor_msgs/Image message. Compatible with Raspberry Pi 64 Bit OS. ROS cv_bridge package is not …☆15Apr 1, 2021Updated 5 years ago
- ☆14May 6, 2019Updated 7 years ago
- ☆11Mar 13, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Tenstorrent Topology (TT-Topology) is a command line utility used to flash multiple NB cards on a system to use specific eth routing conf…☆16Jun 11, 2026Updated last week
- Visual Computing Library☆20Jan 14, 2026Updated 5 months ago
- Extracts addresses from .pbf files to .csv files☆12Apr 13, 2015Updated 11 years ago
- Predicted a stock price close of a day based on the last 7 day’s time series data using Neural Network, LSTM and CNN. Found the best numb…☆11Feb 1, 2020Updated 6 years ago
- ☆16Oct 8, 2019Updated 6 years ago
- Pure Python webserver to serve firmware binary files for Arduino (or ESP, ...) OTA projects.☆14Dec 16, 2018Updated 7 years ago
- Anduril's Lattice Rust SDK☆22Jul 24, 2025Updated 10 months ago
- torchtrail: trace the graph of torch functions and modules for visualization, reports, etc☆25May 25, 2025Updated last year
- Tenstorrent Kernel Module☆65Jun 11, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- TensorFlow implementation of LSTNet model for multivariate time series forecasting.☆14Jun 18, 2024Updated 2 years ago
- Perf monitoring CLI tool for Apple Silicon☆16Jan 1, 2024Updated 2 years ago
- Tenstorrent console based hardware information program☆61Updated this week
- A repository for examples of all kinds of OCL (Object Constraint Language) expressions☆25Dec 29, 2016Updated 9 years ago
- Python code implementing the piecewise segmentation of a signal given in input. Three main algorithms (sliding windows, top down and bott…☆10Mar 15, 2016Updated 10 years ago
- GPS & IMU data to predict Lat, Long using Kalman Prediction.☆16Apr 26, 2019Updated 7 years ago
- 简单搜索引擎,实现了拼写检查、倒排索引 、文档排序☆19May 7, 2019Updated 7 years ago
- MEDCoupling is a versatile data manipulation library for handling meshes and fields in numerical simulation codes using med files☆22May 19, 2026Updated 3 weeks ago
- Frontend integration for PyTorch with tt-mlir☆25Mar 2, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Welcome to BuildingControlLib, a Modelica library for modelling and simulation of standardized and non-standardized control functions fro…☆13Jan 19, 2018Updated 8 years ago
- Xsemantics is a DSL (implemented in Xtext itself) for writing type systems, reduction rules, interpreters (and in general relation rules)…☆35Feb 5, 2026Updated 4 months ago
- Repository of model demos using TT-Buda☆64Apr 2, 2025Updated last year
- This project is an implementation of the paper Modeling Long- and Short-Term Temporal Patterns with Deep Neural Networks. The model LSTNe…☆17May 1, 2019Updated 7 years ago
- A basic sensor fusion performed on GPS and Inertial measurement data☆22Nov 21, 2018Updated 7 years ago
- ☆40Mar 14, 2024Updated 2 years ago
- The Agentuity Cloud Platform Tooling 🤖☆21Jan 9, 2026Updated 5 months ago