☆67Mar 28, 2025Updated last year
Alternatives and similar repositories for vllm-docker
Users that are interested in vllm-docker are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆18Aug 19, 2024Updated last year
- Sample solution to automate tedious regulatory compliance processes using multi-agent systems☆24Apr 15, 2025Updated 11 months ago
- Experimenting text-embeddings-inference server on both CPU and GPU☆18Oct 25, 2023Updated 2 years ago
- OpenAI compatible API for open source LLMs☆17Oct 30, 2023Updated 2 years ago
- Graph model execution API for Candle☆17Jul 27, 2025Updated 8 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- vLLM adapter for a TGIS-compatible gRPC server.☆55Mar 23, 2026Updated last week
- The Open-Source Implementation of Cognition AI's Automated Software Engineer, Devin.☆16Mar 13, 2024Updated 2 years ago
- 🚀 LLM inference optimization simulator, modeling compute-bound prefill and memory-bound decode phases.☆14Jul 12, 2025Updated 8 months ago
- 基于 CUDA Driver API 的 cuda 运行时环境☆16Jul 30, 2025Updated 8 months ago
- ☆12Dec 8, 2020Updated 5 years ago
- A high performance batching router optimises max throughput for text inference workload☆16Sep 6, 2023Updated 2 years ago
- Applying Evaluation Driven Development (EDD) to aid in the design decision of RAG pipelines☆31Oct 20, 2023Updated 2 years ago
- AI_Powered_Dev_Search_Engine☆12Mar 10, 2024Updated 2 years ago
- experiments with inference on llama☆103Jun 6, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ITIX's Custom CoreOS build☆10Nov 24, 2020Updated 5 years ago
- Homeworks, Midterm, & Capstone from ML BookCamp☆16Jan 28, 2022Updated 4 years ago
- Get deterministic output in any format like json from any LLM.☆19Apr 25, 2023Updated 2 years ago
- This repo is the central repo for all the RAG Evaluation reference material and partner workshop☆83Apr 25, 2025Updated 11 months ago
- 🎙️ P³: Lightning-fast podcast processing with Apple Silicon optimization and local LLMs. Parakeet MLX transcription + Ollama analysis = …☆28Aug 25, 2025Updated 7 months ago
- A nats micro service interacting with Ollama☆18Jun 30, 2024Updated last year
- ☆14Jul 26, 2019Updated 6 years ago
- ☆12Dec 24, 2024Updated last year
- ☆27Aug 31, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Authenticating HTTP(S) proxy with TCP/IP tunneling and acceleration—mirror of http://svn.awk.cz/cntlm☆15Jan 21, 2013Updated 13 years ago
- kernel spec, config for vanilla kernel rpms from kernel.org☆10Jan 24, 2022Updated 4 years ago
- ☆12Dec 6, 2021Updated 4 years ago
- Can we predict how much health insurance will cost using regression?☆11Dec 21, 2021Updated 4 years ago
- практические занятия, реальные проекты и техники разработки☆16Mar 23, 2025Updated last year
- DHCP Snooping app - great for finding rogue DHCP servers☆35Jan 25, 2018Updated 8 years ago
- Create embeddings for LLM using the Nomic API☆23Nov 21, 2024Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Sep 19, 2025Updated 6 months ago
- LLM Skirmish☆45Feb 3, 2026Updated last month
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ZED LiveLink Plugin for Unreal☆39Jan 22, 2026Updated 2 months ago
- how to build a sentence embedding application using BentoML☆14Mar 31, 2025Updated 11 months ago
- ☆14Sep 6, 2023Updated 2 years ago
- ☆13Oct 16, 2020Updated 5 years ago
- bundled swagger-ui pip package☆21Sep 4, 2025Updated 6 months ago
- OpenAI compatible API for TensorRT LLM triton backend☆219Aug 1, 2024Updated last year
- Procedurally generated starfield inside a WebGL shader☆10Sep 4, 2016Updated 9 years ago