Support mixed-precsion inference with vllm
☆85Jul 17, 2025Updated 8 months ago
Alternatives and similar repositories for vllm-mixed-precision
Users that are interested in vllm-mixed-precision are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Implementation of AttentionShift: Iteratively Estimated Part-based Attention Map for Pointly Supervised Instance Segmentation☆155Oct 18, 2024Updated last year
- Welcome to the 'Open-Alteryx-Macro' project. This project is aimed at providing an open-source solution for managing and updating Alteryx…☆156May 25, 2024Updated last year
- Mixed precision inference by Tensorrt-LLM☆80Oct 23, 2024Updated last year
- kight is a static analysis tool for c/c++ programs.☆214Dec 27, 2024Updated last year
- Analysis and visualization of multi-omics data. In ongoing development: multi-modal fusion, sparse learning, and spatio-temporal effects.…☆206Jan 15, 2026Updated 2 months ago
- An Workspace for HMI tools☆164Jul 11, 2024Updated last year
- It is an Android-based application that enables managing hotspot properties through a web interface, providing mobile routing functionali…☆154Dec 19, 2024Updated last year
- Imagine building a whole operating system around just your notes.☆80Feb 5, 2025Updated last year
- ☆142May 8, 2024Updated last year
- A code repository designed to show the best GitHub has to offer.☆165Jun 30, 2024Updated last year
- ☆105Jan 24, 2025Updated last year
- ☆247Nov 24, 2024Updated last year
- [ICLR'24] Democratizing Fine-grained Visual Recognition with Large Language Models☆190Jul 15, 2024Updated last year
- 莫甘娜问卷表单编辑器,低代码快速搭建表单,AI表单生成,表单数据搜集统计☆147Aug 9, 2024Updated last year
- ☆176Feb 21, 2025Updated last year
- Evaluation of Text-to-Video Generation Models: A Dynamics Perspective[NeurIPS 2024].☆274Dec 3, 2024Updated last year
- A curated list of awesome papers related to adversarial attacks and defenses for information retrieval. If I missed any papers, feel free…☆221Jul 11, 2024Updated last year
- 🔗 Serverless blockchain analytics pipeline on AWS - Extract, process and visualize Ethereum data using Kinesis, Lambda, Redshift Serverl…☆103Oct 5, 2023Updated 2 years ago
- Inscriptions on CoreDao, powered by Insdexer.☆148Mar 20, 2024Updated 2 years ago
- AI-powered document summarization engine that transforms lengthy texts into crystallized insights☆146Nov 5, 2024Updated last year
- ☆121Sep 30, 2024Updated last year
- AI solution for Patent Classification☆143Jun 29, 2020Updated 5 years ago
- ☆287Jul 6, 2024Updated last year
- A python package that integrate algorithms and various machine learning approaches to extract features (genes) effective for classificati…☆252Jan 15, 2026Updated 2 months ago
- An extension for Visual Studio Code that integrates the power of OpenAI's GPT models into VSCode.☆160Mar 24, 2024Updated 2 years ago
- This script allows the server to isolate computational resources through LXD and pre-install PyTorch in order to share GPUs among differe…☆91Apr 13, 2024Updated last year
- ☆142Nov 13, 2024Updated last year
- Advanced Unsupervised Image Enhancement with GAN☆247Nov 11, 2024Updated last year
- Dive into Nature Simulation v1, a dynamic ecosystem game. Experience life's balance with interactive controls and stunning visuals of flo…☆248Dec 23, 2024Updated last year
- ☆252Feb 11, 2025Updated last year
- C++ codes for FDTD Maxwell's equation.☆161Jun 11, 2023Updated 2 years ago
- User Identity Scaffolding for Multiple OIDC Authentications for User☆95Jun 14, 2025Updated 9 months ago
- check☆100Dec 12, 2025Updated 3 months ago
- ☆143Apr 26, 2024Updated last year
- YiTu is an easy-to-use runtime to fully exploit the hybrid parallelism of different hardwares (e.g., GPU) to efficiently support the exec…☆254Jan 7, 2026Updated 2 months ago
- 最终幻想14英文笔记☆96May 25, 2024Updated last year
- ☆167Jul 14, 2024Updated last year
- ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code (https://arxiv.org/abs/2311.098…☆318Jul 31, 2025Updated 7 months ago
- ☆242Jul 5, 2024Updated last year