codefuse-ai / FasterTransformer4CodeFuseLinks

High-performance LLM inference based on our optimized version of FastTransfomer

☆123

Alternatives and similar repositories for FasterTransformer4CodeFuse

Users that are interested in FasterTransformer4CodeFuse are comparing it to the libraries listed below

Sorting:

codefuse-ai / codefuse
Index of the CodeFuse Repositories
☆138Updated 9 months ago
codefuse-ai / MFTCoder
High Accuracy and efficiency multi-task fine-tuning framework for Code LLMs. This work has been accepted by KDD 2024.
☆688Updated 5 months ago
CodeGeeX / codegeex-fastertransformer
fastertransformer for codegeex model
☆63Updated 2 years ago
WisdomShell / llama_cpp_for_codeshell
CodeShell model in C/C++
☆106Updated 11 months ago
WisdomShell / codeshell-intellij
An intelligent coding assistant plugin for IntelliJ, developed based on CodeShell
☆183Updated last year
BaihaiAI / IDPChat
IDPChat是开放的中文多模态模型
☆56Updated 2 years ago
sophgo / ChatGLM3-TPU
run chatglm3-6b in BM1684X
☆39Updated last year
codefuse-ai / codefuse-evaluation
Industrial-level evaluation benchmarks for Coding LLMs in the full life-cycle of AI native software developing.企业级代码大模型评测体系,持续开放中
☆96Updated 2 months ago
billvsme / my_openai_api
部署你自己的OpenAI api🤩, 基于flask, transformers (使用 Baichuan2-13B-Chat-4bits 模型, 可以运行在单张Tesla T4显卡) ，实现了OpenAI中Chat, Models和Completions接口，包含流式响…
☆94Updated last year
IEIT-Yuan / Yuan2.0-M32
Mixture-of-Experts (MoE) Language Model
☆189Updated 9 months ago
llmapp / openai
Implement OpenAI APIs and plugin-enabled ChatGPT with open source LLM and other models.
☆120Updated last year
FlagAI-Open / Aquila2
The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.
☆442Updated 8 months ago
QwenLM / vllm-gptq
A high-throughput and memory-efficient inference and serving engine for LLMs
☆135Updated 6 months ago
chu-tianxiang / vllm-gptq
A high-throughput and memory-efficient inference and serving engine for LLMs
☆131Updated last year
hpcaitech / SwiftInfer
Efficient AI Inference & Serving
☆471Updated last year
IEIT-Yuan / Yuan-2.0
Yuan 2.0 Large Language Model
☆685Updated 11 months ago
QwenLM / qwen.cpp
C++ implementation of Qwen-LM
☆595Updated 6 months ago
kwai / KwaiYii
☆225Updated last year
mindspore-lab / mindformers
☆168Updated this week
ziwang-com / zero-lora
zero零训练llm调参
☆31Updated last year
sugarforever / spark-api-gateway
☆68Updated last year
Qihoo360 / 360zhinao
360zhinao
☆290Updated last month
eosphoros-ai / DB-GPT-Plugins
Multi-Agents & Plugins repo for DB-GPT, Can complete various tasks around databases.
☆102Updated last year
dataelement / bisheng-rt
bisheng model services backend
☆29Updated 11 months ago
xverse-ai / XVERSE-13B
XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.
☆646Updated last year
aixcoder-plugin / nl2code-dataset
Aix-bench, the Java benchmark for code synthesis problem.
☆51Updated 2 years ago
BAAI-WuDao / Model
“悟道”模型
☆122Updated 3 years ago
01-ai / Descartes
☆109Updated last year
AtomEcho / AtomBulb
旨在对当前主流LLM进行一个直观、具体、标准的评测
☆94Updated 2 years ago
HFAiLab / hai-platform-studio
配合 HAI Platform 使用的集成化用户界面
☆52Updated 2 years ago