asprenger / ray_vllm_inferenceView external linksLinks
A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving.
☆78Apr 6, 2024Updated last year
Alternatives and similar repositories for ray_vllm_inference
Users that are interested in ray_vllm_inference are comparing it to the libraries listed below
Sorting:
- ☆13May 25, 2023Updated 2 years ago
- Pretrain, finetune and serve LLMs on Intel platforms with Ray☆130Sep 23, 2025Updated 4 months ago
- MLFlow Deployment Plugin for Ray Serve☆46Apr 12, 2022Updated 3 years ago
- A testbed for agents and environments that can automatically improve models through data generation.☆28Mar 4, 2025Updated 11 months ago
- RayLLM - LLMs on Ray (Archived). Read README for more info.☆1,263Mar 13, 2025Updated 11 months ago
- Build Neo4J Knowledge Graphs from Excel files☆22Nov 18, 2024Updated last year
- kapi provides a simplified interface to the controller-runtime library.☆26Aug 20, 2025Updated 5 months ago
- Example scripts and configuration files to install and configure IBM Storage Scale in a Vagrant environment☆26Jan 6, 2026Updated last month
- A user-friendly Command & Control (C&C) web platform for remote monitoring, management, and task automation across multiple devices.☆14Dec 15, 2024Updated last year
- PAL: Predictive Analysis & Laws of Large Language Models☆38Jan 9, 2025Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆85Updated this week
- Jupyter notebooks of the simulations ran as part of a semester project on "Quantum Reinforcement Learning and Projective Simulation" at T…☆32Feb 22, 2019Updated 6 years ago
- PARADIS, a lightweight and flexible weather forecast model that tries to Keep It Simple.☆25Feb 4, 2026Updated last week
- automated insights for tabular data☆10Feb 10, 2025Updated last year
- ext_mpi_collectives☆11Apr 1, 2025Updated 10 months ago
- COMS 4111 Project 1☆12Jul 21, 2022Updated 3 years ago
- Implementation of QRL☆32Jun 22, 2019Updated 6 years ago
- HPCG benchmark based on ROCm platform☆39Feb 3, 2026Updated last week
- ☆36Apr 30, 2025Updated 9 months ago
- A converter and basic tester for rwkv onnx☆43Jan 29, 2024Updated 2 years ago
- Jest + TS + Yarn with enabled monorepos☆10Jan 8, 2023Updated 3 years ago
- 🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning☆12Jun 23, 2025Updated 7 months ago
- Magento 2 Tokenized Payment Gateway Module☆11Dec 18, 2025Updated last month
- ☆11Feb 27, 2024Updated last year
- How to build an ACP compliant agent that uses MCP as well!☆11May 6, 2025Updated 9 months ago
- The meat and potatoes behind farosctl☆13Feb 28, 2023Updated 2 years ago
- User Management Application build with Spring Boot, Thymeleaf & MySQL Database☆12Dec 20, 2024Updated last year
- Список open-source проектов для изучения кода☆10Nov 7, 2021Updated 4 years ago
- ☆11Jan 11, 2022Updated 4 years ago
- Visualize machine learning models with Netron in VSCode☆15Nov 23, 2025Updated 2 months ago
- ☆14Jun 10, 2025Updated 8 months ago
- Sequential Parameter Optimization in Python☆14Jan 12, 2026Updated last month
- Performance Counter Reader☆11Sep 14, 2022Updated 3 years ago
- Scrapy抓取豆瓣图书☆10Aug 19, 2016Updated 9 years ago
- DigitalOcean Gradient AI Platform SDK☆16Feb 7, 2026Updated last week
- A web interface for SleekDB written in PHP☆11Jan 22, 2022Updated 4 years ago
- Argonne Leadership Computing Facility OpenCL tutorial☆10Aug 22, 2025Updated 5 months ago
- An Offline and Secure Retrieval-Augmented Generation (RAG) system designed for efficient processing of diverse content types with minimal…☆19Dec 29, 2024Updated last year
- Keyword extraction using Scake, KeyBERT, Fine-tuning Transformer BERT-like models and ChatGPT.☆12May 22, 2023Updated 2 years ago