A high-throughput and memory-efficient inference and serving engine for LLMs
☆35Mar 21, 2024Updated 2 years ago
Alternatives and similar repositories for vllm
Users that are interested in vllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official repository of Evolutionary Optimization of Model Merging Recipes☆1,420Nov 29, 2024Updated last year
- Code for Discovering Preference Optimization Algorithms with and for Large Language Models☆196Jun 13, 2024Updated last year
- CycleQD is a framework for parameter space model merging.☆49Feb 1, 2025Updated last year
- A quick Crew AI tutorial☆23May 9, 2024Updated 2 years ago
- my personal mcp server☆13Apr 23, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [RSS 2023] RoboNinja: Learning an Adaptive Cutting Policy for Multi-Material Objects☆40May 1, 2023Updated 3 years ago
- Checkpoint/rewind extension for the Pi coding agent. 1 checkpoint per turn, /rewind command, diff preview, safe restore, redo stack.☆64Mar 31, 2026Updated last month
- Azure Command-Line Interface☆15Mar 26, 2026Updated last month
- Memory experiments with LLMs☆10Mar 31, 2023Updated 3 years ago
- ☆14Jan 21, 2025Updated last year
- Chef cookbooks for managing a Ceph cluster☆12Apr 2, 2023Updated 3 years ago
- ☆15Aug 7, 2025Updated 9 months ago
- Unofficial entropix impl for Gemma2 and Llama and Qwen2 and Mistral☆17Jan 12, 2025Updated last year
- ☆12May 23, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆11Jan 10, 2024Updated 2 years ago
- Hypercorn is an ASGI and WSGI Server based on Hyper libraries and inspired by Gunicorn.☆18Jan 12, 2026Updated 4 months ago
- An implementation of the paper "Solving the Rubik's Cube without Human Knowledge"☆14Dec 9, 2018Updated 7 years ago
- to analyze martial arts motion with CMU OpenPose. License depends on OpenPose.☆11Mar 28, 2019Updated 7 years ago
- This very simple python script takes inputs from your business and outputs articles written bhy claude.☆13Apr 3, 2024Updated 2 years ago
- I'm bored☆12Nov 30, 2022Updated 3 years ago
- RUN LLAMA-3 70B llm with NVIDIA endpoints☆14Apr 20, 2024Updated 2 years ago
- PyTorchで微分を計算する方法を説明することで、ニューラルネットの操作の一歩手前を理解する。☆18Mar 14, 2023Updated 3 years ago
- Open Co Scientist aims to democratize scientific research by providing an open-source implementation of an AI co-scientist system.☆15Mar 1, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 🍏专门为 2024 书生·浦语大模型挑战赛 (春季赛) 准备的 Repo🍎收录了赫萝相关的微调源码☆12Sep 20, 2024Updated last year
- ☆18Jan 17, 2021Updated 5 years ago
- 🌟 SwarmAgent: A framework for simulating social group dynamics using multi-agent collaboration, aiding insights into collective behavior…☆13Dec 5, 2023Updated 2 years ago
- An open-source AI agent that lives in your terminal.☆45Updated this week
- Template CrewAI allowing for selection of multiple agents including GPT-3, GPT-4, Mixtral, Llama 3, and Gemma☆11May 11, 2024Updated 2 years ago
- ☆16May 16, 2026Updated last week
- ☆12Nov 11, 2025Updated 6 months ago
- Knowledge Based Authentication Performance Metrics Projec☆12Nov 20, 2014Updated 11 years ago
- The official Kamen Rider Craft The 4th Git!!☆22Sep 23, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- The Shopify Automation Toolkit☆15Apr 21, 2024Updated 2 years ago
- ☆105Mar 4, 2024Updated 2 years ago
- Llama 3 ORPO Fine Tuning on A100 in Colab Pro.☆12Apr 21, 2024Updated 2 years ago
- Naive Bayes Tweet Sentiment Classifier in Kotlin☆14Sep 21, 2020Updated 5 years ago
- ☆41May 17, 2026Updated last week
- MLJAR Agent taking part in AutoGPT hackathon. Testing micro-agents and scenario prompts.☆15Oct 25, 2023Updated 2 years ago
- ☆16May 1, 2016Updated 10 years ago