A high-throughput and memory-efficient inference and serving engine for LLMs
☆55Dec 11, 2023Updated 2 years ago
Alternatives and similar repositories for vllm-release
Users that are interested in vllm-release are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Prompt + regex lab☆10Nov 22, 2023Updated 2 years ago
- Detecting Drift in a Diabetes Dataset using Taipy☆12May 19, 2025Updated 11 months ago
- Reverse engineer patterns for use with SpaCy's DependencyMatcher☆36Feb 8, 2020Updated 6 years ago
- Flexible, extensible and scalable web-based speech annotation tool☆14Apr 4, 2025Updated last year
- A quick way to get started with Transformer Lens☆14Dec 13, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆13Aug 7, 2021Updated 4 years ago
- BH hackathon☆14Apr 4, 2024Updated 2 years ago
- meta_llama_2finetuned_text_generation_summarization☆21Jul 21, 2023Updated 2 years ago
- Tools for formatting large language model prompts.☆13Dec 19, 2023Updated 2 years ago
- Tiny evaluation of leading LLMs on competitive programming problems☆14Apr 10, 2026Updated 3 weeks ago
- OpenSource deployment made easy☆10Jun 13, 2015Updated 10 years ago
- Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Silicon☆16May 8, 2025Updated 11 months ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆20Oct 23, 2023Updated 2 years ago
- Sentence Embedding as a Service☆15Jun 30, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A common protocol for AI agent tools☆10Oct 21, 2024Updated last year
- ☆19Dec 31, 2025Updated 4 months ago
- ☆868Dec 8, 2023Updated 2 years ago
- Official inference library for Mistral models☆10,786Apr 20, 2026Updated 2 weeks ago
- Python 3 compatible softphone with support for audio streaming.☆14Apr 18, 2024Updated 2 years ago
- A minimal yet unstoppable blueprint for multi-agent AI—anchored by the rare, far-reaching “Multi-Agent AI DAO” (2017 Prior Art)—empowerin…☆35Jan 11, 2025Updated last year
- Tensor library for machine learning☆17Jul 13, 2023Updated 2 years ago
- These are the files used to create the evezor mass production coaster demo http://evezor.com/coasters☆11May 16, 2017Updated 8 years ago
- Source and documentation for development of autopilot for a surface vessel☆15Jun 3, 2015Updated 10 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Bayesian Inverse Graphics for Few-Shot Concept Learning☆12Mar 16, 2025Updated last year
- ☆15Feb 23, 2026Updated 2 months ago
- Русско-Английский вокодер на GAN☆17Jun 15, 2021Updated 4 years ago
- ☆17Feb 6, 2018Updated 8 years ago
- Implementation of ModernBERT in MLX☆21Jan 7, 2026Updated 4 months ago
- ☆12Apr 19, 2024Updated 2 years ago
- fatt tries to find any purl in your project by looking at predefined fields in the supported packages. These fields describe using a purl…☆11Apr 14, 2026Updated 3 weeks ago
- A high performance batching router optimises max throughput for text inference workload☆16Sep 6, 2023Updated 2 years ago
- ☆12Dec 8, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [CVPR'23 Highlight] Heterogeneous Continual Learning.☆15Dec 5, 2023Updated 2 years ago
- ☆24Nov 19, 2024Updated last year
- Proof of concept about the privilege escalation flaw identified in Google's Osconfig☆10Sep 20, 2020Updated 5 years ago
- Fast model deployment on AWS Lambda☆14Feb 25, 2024Updated 2 years ago
- ☆12Feb 23, 2023Updated 3 years ago
- My second Go program☆12Apr 30, 2020Updated 6 years ago
- Gemini Live API + function calling for patient intake☆24Nov 8, 2025Updated 5 months ago