Community maintained hardware plugin for vLLM on AWS Neuron
☆31May 28, 2026Updated 2 weeks ago
Alternatives and similar repositories for vllm-neuron
Users that are interested in vllm-neuron are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Python-based tool, trained on the state-of-the-art Google Pegasus model, specializing in generating abstracts from given YouTube video …☆10Aug 6, 2023Updated 2 years ago
- ☆62Jun 2, 2026Updated last week
- ☆24Nov 18, 2025Updated 6 months ago
- Auto-tuning for vllm. Getting the best performance out of your LLM deployment (vllm+guidellm+optuna)☆57Mar 17, 2026Updated 2 months ago
- Atamai Image Registration and Segmentation☆22Apr 1, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆13Jul 13, 2025Updated 11 months ago
- nnvm&tvm example of cross compilation and deployment in Nvidia Jetson TX2 platform☆11Apr 17, 2018Updated 8 years ago
- the symbol description of mobilenet v2☆11Sep 7, 2018Updated 7 years ago
- This is an example implementation of a Jive add-on server built and written in Java with SpringBoot + JPA + ThymeLeaf. For more details …☆11May 2, 2018Updated 8 years ago
- Cancellable sagas for the popular mobx library☆12Apr 19, 2018Updated 8 years ago
- ☆13Updated this week
- The (open-source part of) code to reproduce "BPPSA: Scaling Back-propagation by Parallel Scan Algorithm".☆13Jun 7, 2021Updated 5 years ago
- The Amazon ECR Transfer Plugin for Data Transfer Hub(https://github.com/awslabs/data-transfer-hub). Transfer container images from Amazon…☆13Jan 29, 2025Updated last year
- ☆12May 7, 2017Updated 9 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- My code and notes for "From Day Zero to Zero Day", a book on vulnerability research by Eugene Lim.☆32Nov 10, 2025Updated 7 months ago
- Experimenting with Apache Kafka using kafka-node☆12Dec 10, 2022Updated 3 years ago
- Direct volume rendering with WebGL 2.☆12Jun 8, 2026Updated last week
- Use rethinkDB and Instagram API☆15Nov 30, 2015Updated 10 years ago
- Deep Learning inference with AWS Lambda and Amazon EFS☆14Aug 24, 2020Updated 5 years ago
- A DMA Controller for RISCV CPUs☆13Aug 10, 2015Updated 10 years ago
- AFGH Proxy re-encryption for ZeroDB☆21Mar 8, 2016Updated 10 years ago
- 使用预训练语言模型ALBERT做中文NER☆12Jul 14, 2021Updated 4 years ago
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆19Feb 9, 2026Updated 4 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- EVEVALB is a python version of Evalb which is used to score the bracket tree banks.☆16Apr 22, 2019Updated 7 years ago
- An Example of MXNet Models Comilation and Deployment with NNVM in C++☆16Apr 25, 2018Updated 8 years ago
- Example code and helper modules for CS109☆14May 29, 2015Updated 11 years ago
- On-chain generative utilities☆11Sep 30, 2024Updated last year
- Enjoy our curated collection of examples and solutions using unlock protocol.☆15May 31, 2024Updated 2 years ago
- A git mirror of zlib releases☆21Feb 27, 2015Updated 11 years ago
- RSpace running on Docker☆12Updated this week
- Complete transformer from scratch, using only numpy☆47Aug 27, 2024Updated last year
- ☆21Dec 16, 2011Updated 14 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A light-weight deep learning framework implemented in C++.☆13Apr 20, 2018Updated 8 years ago
- ☆19Oct 3, 2023Updated 2 years ago
- Deployment infrastructure for the Image Data Resource☆14Feb 21, 2026Updated 3 months ago
- pydnn: deep neural network library in Python☆15Apr 2, 2015Updated 11 years ago
- A Contour Tree library☆14Jun 16, 2015Updated 10 years ago
- The RECON project creates library for Nios II Microcontroller System and Tool chain. The library includes a collection of hardware config…☆21Dec 31, 2018Updated 7 years ago
- Script for plotting / visualizing three dimensional arrays.☆25Mar 15, 2015Updated 11 years ago