A high-throughput and memory-efficient inference and serving engine for LLMs
☆31May 12, 2025Updated 11 months ago
Alternatives and similar repositories for vllm
Users that are interested in vllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆21Apr 13, 2022Updated 4 years ago
- MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining☆2,097Jun 5, 2025Updated 11 months ago
- ☆16Aug 4, 2024Updated last year
- ☆77Apr 29, 2026Updated last week
- ☆13May 9, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- PyTorch implementation of our CVPR2023 paper "OpenMix: Exploring Out-of-Distribution samples for Misclassification Detection"☆27Oct 16, 2023Updated 2 years ago
- On the effectiveness of adversarial training against common corruptions [UAI 2022]☆30May 16, 2022Updated 3 years ago
- PyTorch implementation of our ECCV 2022 paper "Rethinking Confidence Calibration for Failure Prediction"☆26Jun 10, 2023Updated 2 years ago
- ncnn export & infer mobileclip☆21Aug 18, 2025Updated 8 months ago
- Awesome Resources about MegEngine☆16Mar 2, 2023Updated 3 years ago
- Fully open reproduction of DeepSeek-R1☆11Mar 24, 2025Updated last year
- Real-time AI video segmentation of USB camera and streaming over HTTP☆12Apr 23, 2025Updated last year
- A Benchmark for Failure Detection under Distribution Shifts in Image Classification☆35Oct 19, 2024Updated last year
- 微信(逆向)信息获取DLL☆13Sep 17, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆12Apr 19, 2022Updated 4 years ago
- ☆15Apr 15, 2022Updated 4 years ago
- 基于ncnn的android端的enet分割☆17Mar 29, 2020Updated 6 years ago
- ncnn is a high-performance neural network inference framework optimized for the mobile platform☆14May 20, 2022Updated 3 years ago
- ☆43Jan 25, 2024Updated 2 years ago
- ☆20Sep 28, 2024Updated last year
- 《万界道友》是一款以 AIGC 驱动、高自由度文字体验、修仙世界观为核心的开源游戏。在这里,你将以普通修士之身,借功法、灵根、神通、法宝与奇遇,一步步推演自己的修行之路。☆52Apr 20, 2026Updated 2 weeks ago
- Code and models for the paper Shape-Texture Debiased Neural Network Training (ICLR 2021)☆111Aug 4, 2023Updated 2 years ago
- ☆18Nov 30, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- HayLM是专门为儿童训练的大模型,通过对InternLM的训练和微调,结合儿童心理学、教育学以及对话风格的数据训练,实现与儿童的智能互动,并在交流过程中不断学习和适应用户特性,成为一个伴随儿童成长的虚拟朋友。☆16Feb 5, 2025Updated last year
- 🐱 ncnn int8 模型量化评估☆14Oct 10, 2022Updated 3 years ago
- A repository of Python & PyTorch scripts which (currently) converts .safetensors models into scaled FP8 variants, utilizing gradient desc…☆27Aug 8, 2025Updated 8 months ago
- Call ncnn from Fortran☆19Dec 18, 2022Updated 3 years ago
- Megvii Electric Moped Detector (ONNX based inference)☆13Jul 4, 2021Updated 4 years ago
- Jax implementation of VIT-VQGAN☆10Jan 25, 2024Updated 2 years ago
- ☆17Mar 10, 2023Updated 3 years ago
- Code for our paper "Informative Dropout for Robust Representation Learning: A Shape-bias Perspective" (ICML 2020)☆126Dec 8, 2022Updated 3 years ago
- ☆28Jun 30, 2025Updated 10 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- An object detection codebase based on MegEngine.☆28Dec 14, 2022Updated 3 years ago
- GPU methods for alpha matting, including cutting edge research algorithms by Philip G. Lee.☆12Jan 8, 2014Updated 12 years ago
- MegEngine build with cu11x☆17Mar 13, 2023Updated 3 years ago
- TVMScript kernel for deformable attention☆25Dec 15, 2021Updated 4 years ago
- ☆22Apr 21, 2023Updated 3 years ago
- Approximate the product between infinite functional objects on a manifold -- i.e. belief products☆12Apr 28, 2026Updated last week
- A waifu2x-ncnn-vulkan Rust binding.☆23Aug 29, 2025Updated 8 months ago