mistralai / vllm-releaseView external linksLinks
A high-throughput and memory-efficient inference and serving engine for LLMs
☆53Dec 11, 2023Updated 2 years ago
Alternatives and similar repositories for vllm-release
Users that are interested in vllm-release are comparing it to the libraries listed below
Sorting:
- Repository for go shared libraries (for now).☆11Dec 1, 2025Updated 2 months ago
- TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain…☆15Feb 12, 2024Updated 2 years ago
- PyTorch implementation of a self-attentive speaker embedding☆17Sep 24, 2019Updated 6 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆20Oct 23, 2023Updated 2 years ago
- meta_llama_2finetuned_text_generation_summarization☆21Jul 21, 2023Updated 2 years ago
- ☆24Jul 24, 2023Updated 2 years ago
- torch_quantizer is a out-of-box quantization tool for PyTorch models on CUDA backend, specially optimized for Diffusion Models.☆23Mar 29, 2024Updated last year
- A minimal yet unstoppable blueprint for multi-agent AI—anchored by the rare, far-reaching “Multi-Agent AI DAO” (2017 Prior Art)—empowerin…☆32Jan 11, 2025Updated last year
- Databutton MCP Server☆27Apr 7, 2025Updated 10 months ago
- Platform API Project seed☆12Nov 8, 2023Updated 2 years ago
- ☆32Jun 6, 2024Updated last year
- Red Hat 3scale Istio Mixer Adapter - Add 3scale's API Management to the Service Mesh☆32Jul 30, 2021Updated 4 years ago
- ☆12Sep 21, 2023Updated 2 years ago
- Autonomous Traversal and Object Detection for Rovers☆15Updated this week
- ☆53Feb 7, 2026Updated last week
- ☆47Apr 29, 2025Updated 9 months ago
- This repo contains documentation related to the operation of the OpenBytes project.☆13Oct 29, 2021Updated 4 years ago
- Code and data for the paper "Steering Conversational Large Language Models for Long Emotional Support Conversations" along with a UI to v…☆14Apr 14, 2025Updated 10 months ago
- Terraform code and Sentinel policies for HashiConf-2019 talk/demo☆10Sep 23, 2019Updated 6 years ago
- rddapp: Regression Discontinuity Design Application☆11Sep 2, 2025Updated 5 months ago
- Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization☆12Dec 3, 2024Updated last year
- Open-source Human Feedback Library☆11Oct 25, 2023Updated 2 years ago
- Voila! A smart automatic pet feeder using Arduino Uno + RTC time module for scheduling + multiple sensors.☆10Jun 4, 2024Updated last year
- Evaluation of Oasis Platform - simple install, UI and API☆14Nov 7, 2025Updated 3 months ago
- Prompt + regex lab☆10Nov 22, 2023Updated 2 years ago
- A relatively simple, unified method for reporting on Kubernetes resource issues.☆12Mar 5, 2020Updated 5 years ago
- Automatic Thief Detection via CCTV with Alarm System and Perpetrator Image Capture using YOLOv5 + ROI. This project utilizes computer vis…☆14Oct 21, 2024Updated last year
- Python script demonstrating the process of recovering text from embeddings, highlighting the associated privacy risks and mitigation stra…☆18Nov 19, 2024Updated last year
- WebAppSec Confinement Origin Web Labels☆11Feb 16, 2021Updated 4 years ago
- Repositório do Curso de Ciência de dados para acompanhar as aulas do ignorância zero☆11Jun 27, 2022Updated 3 years ago
- node.js Lutron RadioRa 2 control module - to control lighting, shades, etc.☆12Nov 14, 2017Updated 8 years ago
- Python phase-vocoder implementation with pitch shifting and formant correction☆14Feb 17, 2022Updated 3 years ago
- ☆20Aug 8, 2025Updated 6 months ago
- ☆14Mar 26, 2025Updated 10 months ago
- ☆16Feb 2, 2026Updated last week
- Source and documentation for development of autopilot for a surface vessel☆15Jun 3, 2015Updated 10 years ago
- Indexing framework designed for the automated creation of structured knowledge bases in Azure AI Search☆14Jun 18, 2025Updated 7 months ago
- Automated Quality Control for Dialogflow CX Agents☆14May 3, 2024Updated last year
- Python client library for Mistral AI platform☆694Updated this week