A high-throughput and memory-efficient inference and serving engine for LLMs
☆35Mar 21, 2024Updated 2 years ago
Alternatives and similar repositories for vllm
Users that are interested in vllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official repository of Evolutionary Optimization of Model Merging Recipes☆1,425Nov 29, 2024Updated last year
- Code for Discovering Preference Optimization Algorithms with and for Large Language Models☆197Jun 13, 2024Updated 2 years ago
- CycleQD is a framework for parameter space model merging.☆49Feb 1, 2025Updated last year
- A quick Crew AI tutorial☆23May 9, 2024Updated 2 years ago
- Memory experiments with LLMs☆10Mar 31, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Azure Command-Line Interface☆15Mar 26, 2026Updated 2 months ago
- ☆15Jan 21, 2025Updated last year
- SAM3 ROS1/ROS2 wrapper☆49Mar 9, 2026Updated 3 months ago
- STM32 Project | Personal Experiment☆11Updated this week
- Math library for JavaScript 2D/3D graphics rendering.☆11Aug 30, 2025Updated 9 months ago
- ☆13May 23, 2024Updated 2 years ago
- A Mac OS X application for recording the screen and converting to .webm (for now) -- written in Swift☆10Dec 19, 2014Updated 11 years ago
- an open-source differential drive ROS2-based educational robot (similar to the popular Turtlebot, Andino bot, limo robot, etc.) for learn…☆21Mar 4, 2026Updated 3 months ago
- Remote robot using STM32, ESP8266 and Android☆15Jul 12, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆15Jul 5, 2024Updated last year
- RUN LLAMA-3 70B llm with NVIDIA endpoints☆13Apr 20, 2024Updated 2 years ago
- llmon-py is a multimodal webui for Llama 3-8B.☆15Jul 1, 2024Updated last year
- Open Co Scientist aims to democratize scientific research by providing an open-source implementation of an AI co-scientist system.☆15Mar 1, 2025Updated last year
- ☆17Sep 13, 2024Updated last year
- Code release for "MORE: Multi-mOdal REtrieval Augmented Generative Commonsense Reasoning"☆11Oct 11, 2024Updated last year
- Template CrewAI allowing for selection of multiple agents including GPT-3, GPT-4, Mixtral, Llama 3, and Gemma☆11May 11, 2024Updated 2 years ago
- Knowledge Based Authentication Performance Metrics Projec☆12Nov 20, 2014Updated 11 years ago
- C++ Rapidly-exploring Random Tree (RRT) and RRT* implementation for ROS Melodic Morenia. Includes a visualizer and custom map drawer buil…☆16Jun 3, 2021Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆24Nov 19, 2024Updated last year
- ☆10Aug 15, 2024Updated last year
- ☆107Mar 4, 2024Updated 2 years ago
- Naive Bayes Tweet Sentiment Classifier in Kotlin☆14Sep 21, 2020Updated 5 years ago
- Robotics Book - Parallel Robots: Mechanics and Control Book☆21May 3, 2025Updated last year
- map elites python reference implementation☆98Sep 20, 2023Updated 2 years ago
- Unity, UE4, Python, OSVR, all plugins will be in this repo !☆11Jul 18, 2018Updated 7 years ago
- ☆13Jul 24, 2018Updated 7 years ago
- ☆13May 8, 2025Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- An iOS / macOS Swift Package for Rendering Markdown and LaTeX in a WebView☆43Mar 14, 2026Updated 3 months ago
- Notes and to-dos organizer☆19Jun 8, 2026Updated last week
- PyTorch implementation for the Neural Logic Machines (NLM).☆12May 7, 2019Updated 7 years ago
- Notebooks for CS4305TU Regression Lectures☆11Oct 14, 2022Updated 3 years ago
- Learning TensorFlow☆10Aug 4, 2017Updated 8 years ago
- Get started using Deepgram's Text-to-Speech with this Flask demo app☆16Apr 11, 2026Updated 2 months ago
- An advanced research assistant that utilizes AI agents to generate novel research directions and analyze scientific literature. This plat…☆16Feb 26, 2025Updated last year