β67Mar 28, 2025Updated 11 months ago
Alternatives and similar repositories for vllm-docker
Users that are interested in vllm-docker are comparing it to the libraries listed below
Sorting:
- β18Aug 19, 2024Updated last year
- π LLM inference optimization simulator, modeling compute-bound prefill and memory-bound decode phases.β13Jul 12, 2025Updated 7 months ago
- OpenAI compatible API for open source LLMsβ16Oct 30, 2023Updated 2 years ago
- vLLM adapter for a TGIS-compatible gRPC server.β55Updated this week
- vLLM client with minimal dependenciesβ15Feb 28, 2024Updated 2 years ago
- Experimenting text-embeddings-inference server on both CPU andΒ GPUβ18Oct 25, 2023Updated 2 years ago
- A high performance batching router optimises max throughput for text inference workloadβ16Sep 6, 2023Updated 2 years ago
- A curated list of awesome papers about utilizing large language models for ranking.β31Oct 30, 2024Updated last year
- IBM development fork of https://github.com/huggingface/text-generation-inferenceβ63Sep 18, 2025Updated 5 months ago
- Applying Evaluation Driven Development (EDD) to aid in the design decision of RAG pipelinesβ31Oct 20, 2023Updated 2 years ago
- This repo is the central repo for all the RAG Evaluation reference material and partner workshopβ82Apr 25, 2025Updated 10 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β32Sep 19, 2025Updated 5 months ago
- π A deep-dive into HyDE for Advanced LLM RAG + π‘ Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, coveraβ¦β34Mar 26, 2024Updated last year
- Informative Conversational Query Rewritingβ37Jan 29, 2024Updated 2 years ago
- Compile-time string encryption and import obfuscation for Windows PE32(+) binariesβ16Jan 18, 2026Updated last month
- COMS 4111 Project 1β12Jul 21, 2022Updated 3 years ago
- fine-tuning tutorialβ18Feb 20, 2026Updated 2 weeks ago
- User-friendly viewer for Parquet filesβ10Updated this week
- DOS Program Developmentβ13Nov 9, 2022Updated 3 years ago
- A domain-specific language (DSL) based on Triton but providing higher-level abstractions.β41Updated this week
- The MessageBroker is a typescript library for providing asynchronous communication throughout your appβ20Updated this week
- The official github repo for the open online courses: "Dive into LLMs".β10Mar 15, 2024Updated last year
- Protocol buffers and other common resources.β13Mar 2, 2026Updated last week
- LightGBM for handling label-imbalanced data with focal and weighted loss functions in binary and multiclass classificationβ21Jan 29, 2026Updated last month
- This project is based on the [LTX-Video](https://github.com/Lightricks/LTX-Video) algorithm of the diffusers and optimized and accelerateβ¦β13Dec 31, 2024Updated last year
- Official Java Client for Ipregistry, a Fast, Reliable IP Geolocation and Threat Data API.β16Dec 7, 2025Updated 3 months ago
- Blackbird OSINT tool FrontEnd React Projectβ13Mar 6, 2024Updated 2 years ago
- Different bangla datasets for sentiment analysis on bangla textβ10Nov 26, 2022Updated 3 years ago
- Machine learning algorithms implements with jax for machine learning in production in large scale dataset.β14Mar 2, 2026Updated last week
- β11Dec 6, 2023Updated 2 years ago
- ε©η¨ docker εΏ«ιε»Ίη« pgadmin4γLinux install pgadmin4β12Apr 7, 2024Updated last year
- Curate a list of digital marketing tools, categorized by purpose, with links to documentation, tutorials, and reviews.β10Oct 30, 2023Updated 2 years ago
- Redis distributed lock implementation for Python based on Pub/Sub messagingβ11Feb 14, 2026Updated 3 weeks ago
- β10Jan 9, 2024Updated 2 years ago
- A Streamlit-based chatbot application using Gemini models for NLP. Features include light/dark mode toggle, model selection (Gemini 1.5 Fβ¦β10May 23, 2024Updated last year
- ποΈ PΒ³: Lightning-fast podcast processing with Apple Silicon optimization and local LLMs. Parakeet MLX transcription + Ollama analysis = β¦β28Aug 25, 2025Updated 6 months ago
- β14Dec 12, 2022Updated 3 years ago
- Open Source Project for Defi Crypto Portfolio Managementβ10Feb 4, 2025Updated last year
- β12Jun 17, 2025Updated 8 months ago