Qcompiler / MixQ_Tensorrt_LLM
Mixed precision inference by Tensorrt-LLM
☆94Updated last month
Related projects ⓘ
Alternatives and complementary repositories for MixQ_Tensorrt_LLM
- Support mixed-precsion inference with vllm☆97Updated 2 weeks ago
- [ICME 2024] Official Datasets and example of LLM-SAP: Large Language Model Situational Awareness Based Planning☆42Updated 2 months ago
- High performance rank executor for advertisement and recommendation system, implemented in C/C++ and support ensembled into Java/Scala ho…☆99Updated 8 months ago
- linkedin, seek job information crawler☆105Updated last month
- ☆84Updated last month
- GAL-DAWN: An Novel High performance computing Library of Graph Algorithms based on DAWN, CUDA/C++☆116Updated 3 months ago
- AutoRLAIF is a cutting-edge framework designed to revolutionize the fine-tuning of large language models through Reinforcement Learning …☆118Updated 3 weeks ago
- Rethinking Video-Text Understanding Retrieval from Counterfactually Augmented Data☆48Updated 3 months ago
- ☆33Updated 2 years ago
- an Easy-to-use Tool for Comprehensive Response Evaluation of LLMs☆45Updated last month
- ☆42Updated 9 months ago
- ☆106Updated 3 weeks ago
- A Contextual RAG Bot Framework☆107Updated last month
- [EMNLP 2024 Findings] Official PyTorch Implementation of "Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Ge…☆49Updated last month
- ☆81Updated this week
- Please visit our demonstration website for interactive demonstrations☆42Updated last month
- ☆106Updated last month
- This script allows the server to isolate computational resources through LXD and pre-install PyTorch in order to share GPUs among differe…☆121Updated 7 months ago
- ☆81Updated 2 months ago
- Deep Reinforcement Learning Algorithms for solving Atari 2600 Games☆197Updated last year
- ☆129Updated this week
- The Buddhist Scripture Explanation API is an AI-powered service designed to provide insightful explanations for passages from key Buddhis…☆89Updated 2 months ago
- 即迅语音识别服务,支持语音识别(ASR)、语音合成(TTS)、声纹识别(VPR)等功能,适配国产化arm操作系统,支持CPU快速语音识别☆102Updated 4 months ago
- ☆99Updated 7 months ago
- ☆177Updated last month
- ☆82Updated 3 months ago
- 最终幻想14英文笔记☆139Updated 5 months ago
- 莫甘娜问卷表单编辑器,低代码快速搭建表单,AI表单生成,表单数据搜集统计☆206Updated 3 months ago
- ☆148Updated 6 months ago
- A 3D game involves melee combat and parkour system based on UE5.☆36Updated 5 months ago