Community maintained hardware plugin for vLLM on AWS Neuron
☆28Mar 20, 2026Updated 3 weeks ago
Alternatives and similar repositories for vllm-neuron
Users that are interested in vllm-neuron are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Python-based tool, trained on the state-of-the-art Google Pegasus model, specializing in generating abstracts from given YouTube video …☆10Aug 6, 2023Updated 2 years ago
- ☆24Nov 18, 2025Updated 4 months ago
- Auto-tuning for vllm. Getting the best performance out of your LLM deployment (vllm+guidellm+optuna)☆51Mar 17, 2026Updated 3 weeks ago
- Atamai Image Registration and Segmentation☆21Apr 1, 2026Updated 2 weeks ago
- nnvm&tvm example of cross compilation and deployment in Nvidia Jetson TX2 platform☆11Apr 17, 2018Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- the symbol description of mobilenet v2☆11Sep 7, 2018Updated 7 years ago
- This is an example implementation of a Jive add-on server built and written in Java with SpringBoot + JPA + ThymeLeaf. For more details …☆11May 2, 2018Updated 7 years ago
- Cancellable sagas for the popular mobx library☆12Apr 19, 2018Updated 7 years ago
- ☆13Updated this week
- The (open-source part of) code to reproduce "BPPSA: Scaling Back-propagation by Parallel Scan Algorithm".☆13Jun 7, 2021Updated 4 years ago
- The Amazon ECR Transfer Plugin for Data Transfer Hub(https://github.com/awslabs/data-transfer-hub). Transfer container images from Amazon…☆13Jan 29, 2025Updated last year
- ☆12May 7, 2017Updated 8 years ago
- My code and notes for "From Day Zero to Zero Day", a book on vulnerability research by Eugene Lim.☆32Nov 10, 2025Updated 5 months ago
- Experimenting with Apache Kafka using kafka-node☆13Dec 10, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Direct volume rendering with WebGL 2.☆12Jan 18, 2024Updated 2 years ago
- Use rethinkDB and Instagram API☆15Nov 30, 2015Updated 10 years ago
- AFGH Proxy re-encryption for ZeroDB☆21Mar 8, 2016Updated 10 years ago
- Deep Learning inference with AWS Lambda and Amazon EFS☆14Aug 24, 2020Updated 5 years ago
- A DMA Controller for RISCV CPUs☆13Aug 10, 2015Updated 10 years ago
- 使用预训练语言模型ALBERT做中文NER☆12Jul 14, 2021Updated 4 years ago
- EVEVALB is a python version of Evalb which is used to score the bracket tree banks.☆16Apr 22, 2019Updated 6 years ago
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆19Feb 9, 2026Updated 2 months ago
- An Example of MXNet Models Comilation and Deployment with NNVM in C++☆16Apr 25, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Example code and helper modules for CS109☆14May 29, 2015Updated 10 years ago
- On-chain generative utilities☆11Sep 30, 2024Updated last year
- Enjoy our curated collection of examples and solutions using unlock protocol.☆15May 31, 2024Updated last year
- A git mirror of zlib releases☆21Feb 27, 2015Updated 11 years ago
- RSpace running on Docker☆12Mar 6, 2026Updated last month
- A light-weight deep learning framework implemented in C++.☆13Apr 20, 2018Updated 7 years ago
- ☆21Dec 16, 2011Updated 14 years ago
- Deployment infrastructure for the Image Data Resource☆14Feb 21, 2026Updated last month
- pydnn: deep neural network library in Python☆15Apr 2, 2015Updated 11 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Implementation of the Paillier homomorphic cryptosystem using GMP☆29May 18, 2022Updated 3 years ago
- A Contour Tree library☆14Jun 16, 2015Updated 10 years ago
- The RECON project creates library for Nios II Microcontroller System and Tool chain. The library includes a collection of hardware config…☆21Dec 31, 2018Updated 7 years ago
- Script for plotting / visualizing three dimensional arrays.☆25Mar 15, 2015Updated 11 years ago
- ☆23Apr 7, 2026Updated last week
- This, that and the other - things i would like to share☆21Feb 25, 2026Updated last month
- ☆17Apr 28, 2019Updated 6 years ago