chu-tianxiang / vllm-gptqLinks

A high-throughput and memory-efficient inference and serving engine for LLMs

☆132

Alternatives and similar repositories for vllm-gptq

Users that are interested in vllm-gptq are comparing it to the libraries listed below

Sorting:

the-seeds / imitater
Imitate OpenAI with Local Models
☆89Updated last year
QwenLM / vllm-gptq
A high-throughput and memory-efficient inference and serving engine for LLMs
☆139Updated 11 months ago
Minami-su / character_AI_open
Generate multi-round conversation roleplay data based on self-instruct and evol-instruct.
☆136Updated 10 months ago
AtomEcho / AtomBulb
旨在对当前主流LLM进行一个直观、具体、标准的评测
☆95Updated 2 years ago
yangjianxin1 / LongQLoRA
LongQLoRA: Extent Context Length of LLMs Efficiently
☆167Updated 2 years ago
LowinLi / transformers-stream-generator
This is a text generation method which returns a generator, streaming out each token in real-time during inference, based on Huggingface/…
☆97Updated last year
IEIT-Yuan / Yuan2.0-M32
Mixture-of-Experts (MoE) Language Model
☆192Updated last year
WangRongsheng / Aurora
The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"
☆266Updated last year
SUSTech-IDEA / SUS-Chat
SUS-Chat: Instruction tuning done right
☆49Updated last year
xverse-ai / XVERSE-65B
XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.
☆141Updated last year
SunLemuria / OpenGPTAndBeyond
Open efforts to implement ChatGPT-like models and beyond.
☆107Updated last year
zzlgreat / smart_agent
☆106Updated 2 years ago
DachengLi1 / LongChat
Official repository for LongChat and LongEval
☆532Updated last year
K024 / chatglm-q
Another ChatGLM2 implementation for GPTQ quantization
☆54Updated 2 years ago
Guanaco-Model / Guanaco-Model.github.io
☆123Updated last year
OFA-Sys / Ditto
A self-ailgnment method for role-play. Benchmark for role-play. Resources for "Large Language Models are Superpositions of All Characters…
☆206Updated last year
llmapp / openai
Implement OpenAI APIs and plugin-enabled ChatGPT with open source LLM and other models.
☆121Updated last year
cuplv / text-to-sql-wizardcoder
Leveraging large language models for text-to-SQL synthesis, this project fine-tunes WizardLM/WizardCoder-15B-V1.0 with QLoRA on a custom …
☆45Updated last year
beichao1314 / Open-Llama
The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.
☆67Updated 2 years ago
rag-wtf / open-text-embeddings
Open Source Text Embedding Models with OpenAI Compatible API
☆163Updated last year
CLUEbenchmark / SuperCLUE-RAG
中文原生检索增强生成测评基准
☆123Updated last year
OpenLMLab / ChatZoo
Light local website for displaying performances from different chat models.
☆87Updated 2 years ago
thu-coai / BPO
☆330Updated last year
vtuber-plan / langport
Langport is a language model inference service
☆95Updated last year
LC1332 / Luotuo-Silk-Road
Silk Road will be the dataset zoo for Luotuo(骆驼). Luotuo is an open sourced Chinese-LLM project founded by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子…
☆40Updated 2 years ago
ssbuild / deep_training
deep learning
☆149Updated 6 months ago
taishan1994 / qlora-chinese-LLM
使用qlora对中文大语言模型进行微调，包含ChatGLM、Chinese-LLaMA-Alpaca、BELLE
☆89Updated 2 years ago
CrazyBoyM / CodeLLaMA-chat
CodeLLaMA 中文版 - 代码生成助手，huggingface累积下载2w+次
☆45Updated 2 years ago
OFA-Sys / ExpertLLaMA
An opensource ChatBot built with ExpertPrompting which achieves 96% of ChatGPT's capability.
☆299Updated 2 years ago
alanshi / charset_mnbvc
本项目旨在对大量文本文件进行快速编码检测和转换以辅助mnbvc语料集项目的数据清洗工作
☆67Updated last month