Support mixed-precsion inference with vllm
☆84Jul 17, 2025Updated 8 months ago
Alternatives and similar repositories for vllm-mixed-precision
Users that are interested in vllm-mixed-precision are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Implementation of AttentionShift: Iteratively Estimated Part-based Attention Map for Pointly Supervised Instance Segmentation☆155Oct 18, 2024Updated last year
- Welcome to the 'Open-Alteryx-Macro' project. This project is aimed at providing an open-source solution for managing and updating Alteryx…☆156May 25, 2024Updated last year
- Mixed precision inference by Tensorrt-LLM☆79Oct 23, 2024Updated last year
- kight is a static analysis tool for c/c++ programs.☆213Dec 27, 2024Updated last year
- Analysis and visualization of multi-omics data. In ongoing development: multi-modal fusion, sparse learning, and spatio-temporal effects.…☆205Jan 15, 2026Updated 2 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- An Workspace for HMI tools☆163Jul 11, 2024Updated last year
- It is an Android-based application that enables managing hotspot properties through a web interface, providing mobile routing functionali…☆156Dec 19, 2024Updated last year
- Imagine building a whole operating system around just your notes.☆80Feb 5, 2025Updated last year
- ☆141May 8, 2024Updated last year
- A code repository designed to show the best GitHub has to offer.☆165Jun 30, 2024Updated last year
- ☆105Jan 24, 2025Updated last year
- ☆246Nov 24, 2024Updated last year
- [ICLR'24] Democratizing Fine-grained Visual Recognition with Large Language Models☆189Jul 15, 2024Updated last year
- 莫甘娜问卷表单编辑器,低代码快速搭建表单,AI表单生成,表单数据搜集统计☆147Aug 9, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆175Feb 21, 2025Updated last year
- Evaluation of Text-to-Video Generation Models: A Dynamics Perspective[NeurIPS 2024].