A high-throughput and memory-efficient inference and serving engine for LLMs
☆29May 12, 2025Updated 11 months ago
Alternatives and similar repositories for vllm
Users that are interested in vllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Aug 4, 2024Updated last year
- ☆13May 9, 2023Updated 2 years ago
- PyTorch implementation of our CVPR2023 paper "OpenMix: Exploring Out-of-Distribution samples for Misclassification Detection"☆27Oct 16, 2023Updated 2 years ago
- Awesome Resources about MegEngine☆16Mar 2, 2023Updated 3 years ago
- Fully open reproduction of DeepSeek-R1☆11Mar 24, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Real-time AI video segmentation of USB camera and streaming over HTTP☆12Apr 23, 2025Updated 11 months ago
- Unofficial docker wrapper for Qualcomm SNPE(Snapdragon Neural Processing Engine) SDK☆11Mar 3, 2022Updated 4 years ago
- ☆10Mar 24, 2024Updated 2 years ago
- Tengine 管子是用来快速生产 demo 的辅助工具☆12Jul 15, 2021Updated 4 years ago
- A Benchmark for Failure Detection under Distribution Shifts in Image Classification☆35Oct 19, 2024Updated last year
- Easy to download and parse version of the Smartdoc 2015 - Challenge 1 dataset.☆15Mar 5, 2018Updated 8 years ago
- 微信(逆向)信息获取DLL☆13Sep 17, 2019Updated 6 years ago
- 一个PyTorch实现的五子棋AI项目☆38Mar 16, 2026Updated last month
- ☆15Apr 15, 2022Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ncnn is a high-performance neural network inference framework optimized for the mobile platform☆14May 20, 2022Updated 3 years ago
- The predecessor of CiteLab.☆18Feb 3, 2026Updated 2 months ago
- 《万界道友》是一款以 AIGC 驱动、高自由度文字体验、修仙世界观为核心的开源游戏。在这里,你将以普通修士之身,借功法、灵根、神通、法宝与奇遇,一步步推演自己的修行之路。☆47Mar 20, 2026Updated 3 weeks ago
- Algorithms for URL Classification☆19Apr 13, 2015Updated 11 years ago
- Using ncnn to test the reasoning performance of neural network☆38Jan 18, 2026Updated 2 months ago
- ☆14Apr 16, 2019Updated 7 years ago
- Code and models for the paper Shape-Texture Debiased Neural Network Training (ICLR 2021)☆111Aug 4, 2023Updated 2 years ago
- ☆18Nov 30, 2022Updated 3 years ago
- 🐱 ncnn int8 模型量化评估☆14Oct 10, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A repository of Python & PyTorch scripts which (currently) converts .safetensors models into scaled FP8 variants, utilizing gradient desc…☆27Aug 8, 2025Updated 8 months ago
- Call ncnn from Fortran☆19Dec 18, 2022Updated 3 years ago
- Megvii Electric Moped Detector (ONNX based inference)☆13Jul 4, 2021Updated 4 years ago
- ☆29Feb 6, 2018Updated 8 years ago
- useful dotfiles included vim, zsh, tmux and vscode☆19Apr 3, 2026Updated last week
- Code for our paper "Informative Dropout for Robust Representation Learning: A Shape-bias Perspective" (ICML 2020)☆126Dec 8, 2022Updated 3 years ago
- An object detection codebase based on MegEngine.☆28Dec 14, 2022Updated 3 years ago
- MegEngine build with cu11x☆17Mar 13, 2023Updated 3 years ago
- ☆22Apr 21, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Approximate the product between infinite functional objects on a manifold -- i.e. belief products☆12Apr 7, 2026Updated last week
- Official PyTorch implementation of "Towards Deeper Graph Neural Networks" [KDD2020]☆155Oct 11, 2022Updated 3 years ago
- Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs☆210Dec 4, 2025Updated 4 months ago
- ☆21Apr 27, 2022Updated 3 years ago
- 有关末日三问及其衍生作品的AIGC项目目录 AIGC Projects related to Sukasuka series☆20Sep 25, 2023Updated 2 years ago
- [AAAI-2025] Towards Efficient and Intelligent Laser Weeding: Method and Dataset for Weed Stem Detection☆34May 15, 2025Updated 11 months ago
- This repository contains the official code for "Flexible Biometrics Recognition: Bridging the Multimodality Gap through Attention, Alignm…☆11Oct 9, 2024Updated last year