A high-throughput and memory-efficient inference and serving engine for LLMs
☆35Mar 21, 2024Updated 2 years ago
Alternatives and similar repositories for vllm
Users that are interested in vllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official repository of Evolutionary Optimization of Model Merging Recipes☆1,414Nov 29, 2024Updated last year
- Code for Discovering Preference Optimization Algorithms with and for Large Language Models☆193Jun 13, 2024Updated last year
- CycleQD is a framework for parameter space model merging.☆49Feb 1, 2025Updated last year
- A quick Crew AI tutorial☆23May 9, 2024Updated last year
- my personal mcp server☆13Apr 23, 2025Updated 11 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Memory experiments with LLMs☆10Mar 31, 2023Updated 3 years ago
- Azure Command-Line Interface☆14Mar 26, 2026Updated 2 weeks ago
- Hypercorn is an ASGI and WSGI Server based on Hyper libraries and inspired by Gunicorn.☆15Jan 12, 2026Updated 3 months ago
- Chef cookbooks for managing a Ceph cluster☆12Apr 2, 2023Updated 3 years ago
- Unofficial entropix impl for Gemma2 and Llama and Qwen2 and Mistral☆17Jan 12, 2025Updated last year
- ☆14Aug 7, 2025Updated 8 months ago
- ☆11Jan 10, 2024Updated 2 years ago
- A Mac OS X application for recording the screen and converting to .webm (for now) -- written in Swift☆10Dec 19, 2014Updated 11 years ago
- An implementation of the paper "Solving the Rubik's Cube without Human Knowledge"☆14Dec 9, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- to analyze martial arts motion with CMU OpenPose. License depends on OpenPose.☆11Mar 28, 2019Updated 7 years ago
- Artifacts for the "SurgeProtector: Mitigating Temporal Algorithmic Complexity Attacks using Adversarial Scheduling" paper that appears in…☆13Jun 24, 2022Updated 3 years ago
- llmon-py is a multimodal webui for Llama 3-8B.☆16Jul 1, 2024Updated last year
- PyTorchで微分を計算する方法を説明することで、ニューラルネットの操作の一歩手前を理解する。☆18Mar 14, 2023Updated 3 years ago
- ☆138May 15, 2024Updated last year
- Online Chat App using React JS☆10Mar 29, 2026Updated 2 weeks ago
- Open Co Scientist aims to democratize scientific research by providing an open-source implementation of an AI co-scientist system.☆15Mar 1, 2025Updated last year
- RabbitMQ on Render☆16Feb 18, 2026Updated last month
- 🍏专门为 2024 书生·浦语大模型挑战赛 (春季赛) 准备的 Repo🍎收录了赫萝相关的微调源码☆12Sep 20, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This very simple python script takes inputs from your business and outputs articles written bhy claude.☆13Apr 3, 2024Updated 2 years ago
- 🌟 SwarmAgent: A framework for simulating social group dynamics using multi-agent collaboration, aiding insights into collective behavior…☆13Dec 5, 2023Updated 2 years ago
- Code release for "MORE: Multi-mOdal REtrieval Augmented Generative Commonsense Reasoning"☆11Oct 11, 2024Updated last year
- ☆17Sep 13, 2024Updated last year
- Template CrewAI allowing for selection of multiple agents including GPT-3, GPT-4, Mixtral, Llama 3, and Gemma☆11May 11, 2024Updated last year
- ☆40Apr 6, 2026Updated last week
- this is a trained yolov8n network that only detects people, at "eye-height", trained in a super basic way on COCO☆13Dec 18, 2023Updated 2 years ago
- The Shopify Automation Toolkit☆15Apr 21, 2024Updated last year
- Llama 3 ORPO Fine Tuning on A100 in Colab Pro.☆12Apr 21, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆10Feb 2, 2024Updated 2 years ago
- Depot - MacOS app for managing 3D model content and resources.☆11May 18, 2023Updated 2 years ago
- Unity, UE4, Python, OSVR, all plugins will be in this repo !☆11Jul 18, 2018Updated 7 years ago
- Handshake Decentralized SLDs☆29Aug 29, 2023Updated 2 years ago
- ☆21Oct 4, 2025Updated 6 months ago
- ☆12Jul 24, 2018Updated 7 years ago
- MCP Server to make searching openrouter easy☆19Feb 28, 2026Updated last month