Qcompiler/vllm-mixed-precision

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Qcompiler/vllm-mixed-precision)

Qcompiler / vllm-mixed-precision

Support mixed-precsion inference with vllm

☆85

Alternatives and similar repositories for vllm-mixed-precision

Users that are interested in vllm-mixed-precision are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MingXiangL / AttentionShift
View on GitHub
Official Implementation of AttentionShift: Iteratively Estimated Part-based Attention Map for Pointly Supervised Instance Segmentation
☆155Oct 18, 2024Updated last year
SiyangLi99 / open-alteryx-macro
View on GitHub
Welcome to the 'Open-Alteryx-Macro' project. This project is aimed at providing an open-source solution for managing and updating Alteryx…
☆156May 25, 2024Updated 2 years ago
Qcompiler / MixQ_Tensorrt_LLM
View on GitHub
Mixed precision inference by Tensorrt-LLM
☆79Oct 23, 2024Updated last year
SSSYDYSSS / TransProR
View on GitHub
Analysis and visualization of multi-omics data. In ongoing development: multi-modal fusion, sparse learning, and spatio-temporal effects.…
☆206Jan 15, 2026Updated 6 months ago
shenjunjiekoda / knight
View on GitHub
kight is a static analysis tool for c/c++ programs.
☆213Dec 27, 2024Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
ZivJia / hmi-workspace
View on GitHub
An Workspace for HMI tools
☆163Jul 11, 2024Updated 2 years ago
jtun-coder / JtunRouter
View on GitHub
It is an Android-based application that enables managing hotspot properties through a web interface, providing mobile routing functionali…
☆156Jul 14, 2026Updated last week
x-tropy / noteOS
View on GitHub
Imagine building a whole operating system around just your notes.
☆79Feb 5, 2025Updated last year
BiuYeaf / A-general-framework-to-Prompt-tuning-LLM-model
View on GitHub
☆141May 8, 2024Updated 2 years ago
Credit-card-monitoring-and-fraud-check / Credit_card_monitoring_and_check
View on GitHub
A code repository designed to show the best GitHub has to offer.
☆165Jun 30, 2024Updated 2 years ago
Rhythm-Byte / SchemaDiff
View on GitHub
☆246Nov 24, 2024Updated last year
wenlongliaoEE / loadforecast
View on GitHub
☆105Jan 24, 2025Updated last year
OatmealLiu / FineR
View on GitHub
[ICLR'24] Democratizing Fine-grained Visual Recognition with Large Language Models
☆189Jul 15, 2024Updated 2 years ago
Davion-Liu / Awesome-Robustness-in-Information-Retrieval
View on GitHub
A curated list of awesome papers related to adversarial attacks and defenses for information retrieval. If I missed any papers, feel free…
☆220Jul 11, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
banggx / morgana-form
View on GitHub
莫甘娜问卷表单编辑器，低代码快速搭建表单，AI表单生成，表单数据搜集统计
☆147Jun 21, 2026Updated last month
wenlongliaoEE / ETDToolbox
View on GitHub
☆175Feb 21, 2025Updated last year
MingXiangL / DEVIL
View on GitHub
Evaluation of Text-to-Video Generation Models: A Dynamics Perspective[NeurIPS 2024].
☆274Dec 3, 2024Updated last year
tsaol / Web3-serverless-analytics-on-aws
View on GitHub
🔗 Serverless blockchain analytics pipeline on AWS - Extract, process and visualize Ethereum data using Kinesis, Lambda, Redshift Serverl…
☆102Oct 5, 2023Updated 2 years ago
corescriptions / indexer
View on GitHub
Inscriptions on CoreDao, powered by Insdexer.
☆147Mar 20, 2024Updated 2 years ago
SSSYDYSSS / TransProPy
View on GitHub
A python package that integrate algorithms and various machine learning approaches to extract features (genes) effective for classificati…
☆251Jan 15, 2026Updated 6 months ago
johngai19 / TextDistiller
View on GitHub
AI-powered document summarization engine that transforms lengthy texts into crystallized insights
☆146Nov 5, 2024Updated last year
arktrail / Dorothy-Ymir
View on GitHub
AI solution for Patent Classification
☆142Jun 29, 2020Updated 6 years ago
Nonac / DDOPaI
View on GitHub
☆120Sep 30, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
witcherofresearch / Forgedit
View on GitHub
☆284Jul 6, 2024Updated 2 years ago
BugBearer / GPT-INT
View on GitHub
An extension for Visual Studio Code that integrates the power of OpenAI's GPT models into VSCode.
☆159Mar 24, 2024Updated 2 years ago
Nonac / LXD_Build
View on GitHub
This script allows the server to isolate computational resources through LXD and pre-install PyTorch in order to share GPUs among differe…
☆91Apr 13, 2024Updated 2 years ago
ireneli961111 / data-aggregation-federated-learning
View on GitHub
☆142Nov 13, 2024Updated last year
Falling-dow / Unsupervised-Image-Enhancement-with-CNN-and-GAN
View on GitHub
Advanced Unsupervised Image Enhancement with GAN
☆247Nov 11, 2024Updated last year
YPAndrew0907 / Animal-Simulation-game
View on GitHub
Dive into Nature Simulation v1, a dynamic ecosystem game. Experience life's balance with interactive controls and stunning visuals of flo…
☆248Dec 23, 2024Updated last year
yileijin / Bootstrap-GS
View on GitHub
☆251Feb 11, 2025Updated last year
pentilm / FDTDMetamaterial
View on GitHub
C++ codes for FDTD Maxwell's equation.
☆164Jun 11, 2023Updated 3 years ago
EduKgs / entity_linking
View on GitHub
☆142Apr 26, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
530051970 / auth-hub-demo
View on GitHub
User Identity Scaffolding for Multiple OIDC Authentications for User
☆95Jun 14, 2025Updated last year
CGCL-codes / YiTu
View on GitHub
YiTu is an easy-to-use runtime to fully exploit the hybrid parallelism of different hardwares (e.g., GPU) to efficiently support the exec…
☆254Jan 7, 2026Updated 6 months ago
liyao-l-y / ICEDroid
View on GitHub
check
☆101Dec 12, 2025Updated 7 months ago
gersteinlab / ML-Bench
View on GitHub
ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code (https://arxiv.org/abs/2311.098…
☆315Jul 31, 2025Updated 11 months ago
Kaida-Amethyst / ffxiv_notes
View on GitHub
最终幻想14英文笔记
☆96May 25, 2024Updated 2 years ago
sql-agi / DB-GPT-X
View on GitHub
☆242Jun 16, 2026Updated last month
MangoKiller / MolTC
View on GitHub
☆168Jul 14, 2024Updated 2 years ago