aws-neuron/upstreaming-to-vllm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/aws-neuron/upstreaming-to-vllm)

aws-neuron / upstreaming-to-vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

☆25

Alternatives and similar repositories for upstreaming-to-vllm

Users that are interested in upstreaming-to-vllm are comparing it to the libraries listed below

Sorting:

aws-neuron / nki-llama
View on GitHub
Project showing how to develop NKI kernels for Llama 3.2 1B inference
☆21May 29, 2025Updated 9 months ago
aws-neuron / transformers-neuronx
View on GitHub
☆110Jan 16, 2025Updated last year
NLPInBLCU / Tutorials-for-Freshmen
View on GitHub
北语 246 实验室新生简明指南
☆10May 30, 2022Updated 3 years ago
aws-samples / sample-tech-for-trading
View on GitHub
☆22Feb 11, 2026Updated 3 weeks ago
vllm-project / dashboard
View on GitHub
vLLM performance dashboard
☆42Apr 26, 2024Updated last year
mariekemeelen / actib
View on GitHub
This repository will soon contain all scripts and links to the annotated corpora of Tibetan.
☆13Feb 4, 2025Updated last year
rpanchyk / mt5-fvg-ind
View on GitHub
Forex Fair Value Gap Indicator for MT5
☆13Dec 11, 2024Updated last year
huggingface / optimum-neuron
View on GitHub
Training and inference on AWS Trainium and Inferentia chips.
☆261Updated this week
Tencent-Hunyuan / Hunyuan-4B
View on GitHub
☆17Aug 5, 2025Updated 6 months ago
Minju-nimm / MIT_PJT
View on GitHub
어린이를 위한 동화 제작 서비스, My AI Fairy-Tale
☆11Apr 7, 2023Updated 2 years ago
Cindyalifia / face-mask-detection
View on GitHub
Face Mask Detection using OpenCV and caffee to face detection
☆12Jun 28, 2020Updated 5 years ago
fridgelock-lkm / fridgelock
View on GitHub
A proof-of-concept implementation of suspend time memory encryption.
☆10Feb 26, 2020Updated 6 years ago
flathub / com.axosoft.GitKraken
View on GitHub
☆11Feb 19, 2026Updated last week
allenai / decon
View on GitHub
decontamination
☆26Dec 3, 2025Updated 3 months ago
zenn-dev / google-cloud-workshop-202311-xenn
View on GitHub
Google Cloud の Cloud Run で架空のWebアプリ Xenn を構築するハンズオン資料です
☆12Dec 6, 2024Updated last year
openSUSE / fde-tools
View on GitHub
Tools for controlling full disk encryption
☆14Jan 30, 2026Updated last month
aws-solutions-library-samples / guidance-for-scalable-model-inference-and-agentic-ai-on-amazon-eks
View on GitHub
Comprehensive, scalable ML inference architecture using Amazon EKS, leveraging Graviton processors for cost-effective CPU-based inference…
☆21Feb 14, 2026Updated 2 weeks ago
foundation-model-stack / fms-acceleration
View on GitHub
🚀 Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.
☆13Jan 30, 2026Updated last month
Wojtab / minigpt-4-pipeline
View on GitHub
☆16Jun 6, 2023Updated 2 years ago
metterian / korean_bert_score
View on GitHub
BERT score for text generation
☆12Jan 15, 2025Updated last year
twjackysu / TWSEMCPServer
View on GitHub
台灣證交所OpenAPI 的 MCP Server
☆40Feb 8, 2026Updated 3 weeks ago
zhangir-azerbayev / MetaMath
View on GitHub
☆11Oct 11, 2023Updated 2 years ago
aws-samples / end-2-end-3d-ml
View on GitHub
This repository features Amazon SageMaker Ground Truth and explains how to ingest raw 3D point cloud data, label it, train a 3D object de…
☆13Jun 23, 2022Updated 3 years ago
Gopi-Durgaprasad / ZINDI-GIZ-NLP-Agricultural-Keyword-Spotter-3rd-place-solution
View on GitHub
ZINDI GIZ NLP Agricultural Keyword Spotter 3rd place solution, Audio Classification
☆11Sep 8, 2021Updated 4 years ago
openSUSE / paste-o-o
View on GitHub
The new software behind openSUSE Paste
☆22Oct 2, 2025Updated 5 months ago
kubvernor / kubvernor
View on GitHub
Kubernetes Gateway API implementation in Rust
☆23Updated this week
camenduru / LLaVA-OneVision-jupyter
View on GitHub
☆13Aug 12, 2024Updated last year
alphacep / sherpa-onnx
View on GitHub
Real-time speech recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, R…
☆10Jan 29, 2026Updated last month
huggingface / llm-course
View on GitHub
A course on building Large Language Models
☆11Mar 24, 2025Updated 11 months ago
stefan-woerner / cvar_quantum_optimization
View on GitHub
Supporting material for https://arxiv.org/abs/1907.04769
☆12Sep 20, 2021Updated 4 years ago
sambanova / tutorials
View on GitHub
☆13Apr 30, 2024Updated last year
bigcode-project / opt-out-v2
View on GitHub
Repository for opt-out requests.
☆10Mar 25, 2024Updated last year
causalNLP / amr_llm
View on GitHub
This repo explores how AMR to address tasks difficult for LLMs
☆13Jan 15, 2024Updated 2 years ago
DataXujing / Bert_TensorRT
View on GitHub
Bert TensorRT模型加速部署
☆10Apr 1, 2022Updated 3 years ago
r-ccs-cms / sbd
View on GitHub
☆20Feb 5, 2026Updated 3 weeks ago
secureblue / secureblue.dev
View on GitHub
secureblue's static website
☆18Updated this week
vtalpaert / pytorch-feudal-network
View on GitHub
Pytorch implementation of [Feudal Net](https://arxiv.org/abs/1703.01161). ([Tensorflow version](https://github.com/dmakian/feudal_networ…
☆17Jun 25, 2019Updated 6 years ago
tomo835g / Deep-Learning-to-find-Superconductors
View on GitHub
Deep-Learning-to-find-Superconductors
☆12Jan 13, 2021Updated 5 years ago
liouvill / PgDMM
View on GitHub
Physics-guided Deep Markov Models
☆13May 24, 2022Updated 3 years ago