☆68Mar 28, 2025Updated last year
Alternatives and similar repositories for vllm-docker
Users that are interested in vllm-docker are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆18Aug 19, 2024Updated last year
- Experimenting text-embeddings-inference server on both CPU and GPU☆18Oct 25, 2023Updated 2 years ago
- vLLM client with minimal dependencies☆15Feb 28, 2024Updated 2 years ago
- OpenAI compatible API for open source LLMs☆17Oct 30, 2023Updated 2 years ago
- LLMs as Collaboratively Edited Knowledge Bases☆50Feb 8, 2026Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- vLLM adapter for a TGIS-compatible gRPC server.☆55Updated this week
- The Open-Source Implementation of Cognition AI's Automated Software Engineer, Devin.☆16Mar 13, 2024Updated 2 years ago
- 🚀 LLM inference optimization simulator, modeling compute-bound prefill and memory-bound decode phases.☆14Jul 12, 2025Updated 11 months ago
- 基于 CUDA Driver API 的 cuda 运行时环境☆16Jul 30, 2025Updated 10 months ago
- ☆12Dec 8, 2020Updated 5 years ago
- ☆28May 3, 2023Updated 3 years ago
- A high performance batching router optimises max throughput for text inference workload☆16Sep 6, 2023Updated 2 years ago
- Applying Evaluation Driven Development (EDD) to aid in the design decision of RAG pipelines☆31Oct 20, 2023Updated 2 years ago
- AI_Powered_Dev_Search_Engine☆12Mar 10, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- experiments with inference on llama☆103Jun 6, 2024Updated 2 years ago
- Prediction of the activity of molecules/ligands that have been tested to bind or not bind to Beta-Lactamases using machine learning cl…☆10Mar 5, 2026Updated 3 months ago
- This repo is the central repo for all the RAG Evaluation reference material and partner workshop☆88Apr 25, 2025Updated last year
- ☆10Jun 30, 2022Updated 3 years ago
- A high-performance Rust implementation of the Hermes-Agent orchestration loop for LLM-driven tool execution.☆44May 25, 2026Updated 3 weeks ago
- ☆12Dec 6, 2021Updated 4 years ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Sep 19, 2025Updated 9 months ago
- Create embeddings for LLM using the Nomic API☆23Nov 21, 2024Updated last year
- bundled swagger-ui pip package☆21Sep 4, 2025Updated 9 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Fullstack Reddit Clone Made With React+Redux & Django☆12Sep 26, 2023Updated 2 years ago
- OpenAI compatible API for TensorRT LLM triton backend☆221Aug 1, 2024Updated last year
- ☆24Jun 18, 2025Updated last year
- The driver for LMCache core to run in vLLM☆67Feb 4, 2025Updated last year
- ☆12Jun 17, 2025Updated last year
- 修改 pyTranscriber 得到 cli 版的 mypyTranscriber☆11Jun 9, 2021Updated 5 years ago
- ☆14May 21, 2021Updated 5 years ago
- Python Binding for Rust WhatLang, a language detection library☆14Jan 5, 2024Updated 2 years ago
- The API Gateway & Microservice Management Layer, built on NGINX☆11Jul 5, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Implementations of transformer models in pytorch☆14Jun 2, 2020Updated 6 years ago
- auto-rust is an experimental project that automatically generate Rust code with LLM (Large Language Models) during compilation, utilizing…☆51Nov 12, 2024Updated last year
- Learn Agentic AI using CrewAI, LangChain, LangGraph, and Knowledge Graphs.☆12Feb 19, 2025Updated last year
- Examples of demo deployment using Gradio. Image Classification, Live Webcam Segmentation, APIs , Tunneling etc.☆17Oct 17, 2022Updated 3 years ago
- A2AMCP is a Agent2Agent MCP communication Server taking the concept from Google's Agent2Agent Protocol (A2A)☆19Jun 9, 2025Updated last year
- A library for web scraping, inspired by MetaInspector☆11May 29, 2017Updated 9 years ago
- A set of methods for finding an appropriate number of topics in a text collection☆15Apr 13, 2026Updated 2 months ago