Evaluate how vLLM and SGLang perform when running a small LLM model on a mid-range NVIDIA GPU
☆19Feb 4, 2026Updated last month
Alternatives and similar repositories for vllm-sglang-perf
Users that are interested in vllm-sglang-perf are comparing it to the libraries listed below
Sorting:
- Examples of App of Apps Pattern☆10Jan 17, 2023Updated 3 years ago
- ☆52Jul 10, 2025Updated 8 months ago
- llm-d helm charts and deployment examples☆50Updated this week
- vTPM with SGX protection☆11May 30, 2019Updated 6 years ago
- !!!!(DEMO)!!!! !!! CHECK OUT THE NEW VERSİON !!! Counting Close People with Yolov7☆13Sep 14, 2022Updated 3 years ago
- The repository for the paper "Predicting in-hospital mortality by combining clinical notes with time-series data"☆12May 23, 2021Updated 4 years ago
- libtpms / swtpm software emulation of a Trusted Platform Module (TPM 1.2 and TPM 2.0) compile script☆13Sep 16, 2020Updated 5 years ago
- Spark interface to the TileDB storage manager [please see README]☆17Dec 23, 2024Updated last year
- Deep Learning library for Python. Convnets, recurrent neural networks, and more. Runs on Theano or TensorFlow.☆12Dec 24, 2016Updated 9 years ago
- ☆14Feb 12, 2024Updated 2 years ago
- ☆19Aug 23, 2025Updated 6 months ago
- Recursive Self-Aggregation evals on ARC-AGI☆28Jan 26, 2026Updated last month
- StartupHeroes Checkstyle project with additional Checkstyle checks and Sonar Checkstyle plugin☆11Jan 25, 2024Updated 2 years ago
- Aerospike monitoring for Graphite - a community driven open source project☆14Feb 18, 2026Updated 2 weeks ago
- Implementation for Decision-focused Summarization (EMNLP2021)☆12Mar 14, 2022Updated 3 years ago
- k8s CSI driver for FastCFS☆13Mar 17, 2024Updated last year
- Semantic code search that scales from private local processing to cloud-scale GPU acceleration☆39Feb 20, 2026Updated 2 weeks ago
- Metadata Editor user and practice guide☆17Mar 2, 2026Updated last week
- A lightweight, self-hosted infrastructure layer for deploying and managing LLM agents as resilient microservices. Features automatic r…☆17Aug 4, 2025Updated 7 months ago
- A fork of Jurgen Vangael's Infinite HMM matlab code☆12Mar 14, 2014Updated 11 years ago
- Functional image processing☆14May 31, 2024Updated last year
- Deep recommendation system☆13Dec 28, 2016Updated 9 years ago
- ☆15Oct 4, 2024Updated last year
- Microsoft Azure PaaS 인 WebApp 에 Django 배포☆10Jul 26, 2016Updated 9 years ago
- Customized Claude Code system prompts for use with tweakcc — ~48k bytes smaller, 30% faster, same accuracy☆34Nov 23, 2025Updated 3 months ago
- Fixes the rotation of the images based on EXIF data☆15Feb 16, 2026Updated 3 weeks ago
- ODQA Baseline 팀프로젝트 이슈/정보 저장용 레포입니다.☆12May 22, 2021Updated 4 years ago
- A scikit-learn compliant implementation of Monroe et al.'s Fightin' Words analysis method.☆11Mar 10, 2019Updated 7 years ago
- MULTIOPED: A Corpus of Multi-Perspective News Editorials.☆12Aug 25, 2021Updated 4 years ago
- 一个支持多轮问数的ChatBI示例代码,可直接使用。A demo of ChatBI,support by FocusGPT☆12Apr 23, 2025Updated 10 months ago
- A git repo showcasing RAG Techniques for building Naive to Advance RAG solutions☆13Feb 16, 2025Updated last year
- ☆15Sep 22, 2024Updated last year
- Tool to perform paired evaluation of automatic systems☆13Oct 20, 2021Updated 4 years ago
- ☆13Dec 18, 2024Updated last year
- A custom Huggingface trainer which supports logging auxiliary losses returned by your model☆15Jul 27, 2025Updated 7 months ago
- Central repository for all public AIDA resources☆13Mar 1, 2021Updated 5 years ago
- ☆12Jul 31, 2025Updated 7 months ago
- Better decision-making in large groups, by encouraging development of proposals by forking and merging.☆16Jun 1, 2023Updated 2 years ago
- Library for action model acquisition from state trace data.☆24Jan 7, 2025Updated last year