codefuse-ai / FasterTransformer4CodeFuseLinks
High-performance LLM inference based on our optimized version of FastTransfomer
☆123Updated last year
Alternatives and similar repositories for FasterTransformer4CodeFuse
Users that are interested in FasterTransformer4CodeFuse are comparing it to the libraries listed below
Sorting:
- Index of the CodeFuse Repositories☆138Updated 9 months ago
- High Accuracy and efficiency multi-task fine-tuning framework for Code LLMs. This work has been accepted by KDD 2024.☆688Updated 5 months ago
- fastertransformer for codegeex model☆63Updated 2 years ago
- Implement OpenAI APIs and plugin-enabled ChatGPT with open source LLM and other models.☆120Updated 11 months ago
- run chatglm3-6b in BM1684X☆39Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆136Updated 6 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆131Updated 11 months ago
- 360zhinao☆289Updated 3 weeks ago
- 部署你自己的OpenAI api🤩, 基于flask, transformers (使用 Baichuan2-13B-Chat-4bits 模型, 可以运行在单张Tesla T4显卡) ,实现了OpenAI中Chat, Models和Completions接口,包含流式响…☆93Updated last year
- UnitGen 是一个用于生成微调代码的数据框架 —— 直接从你的代码库中生成微调数据:代码补全、测试生成、文档生成等。UnitGen is a code fine-tuning data framework that generates data from your ex…☆56Updated 11 months ago
- Industrial-level evaluation benchmarks for Coding LLMs in the full life-cycle of AI native software developing.企业级代码大模型评测体系,持续开放中☆96Updated last month
- 支持中文场景的的小语言模型 llama2.c-zh☆147Updated last year
- The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.☆441Updated 7 months ago
- 纯c++的全平台llm加速库,支持python调用,支持baichuan, glm, llama, moss基座,手机端流畅运行chatglm-6B级模型单卡可达10000+token / s,☆45Updated last year
- Mixture-of-Experts (MoE) Language Model☆189Updated 8 months ago
- bisheng model services backend☆27Updated 10 months ago
- ☆328Updated 11 months ago
- Multi-Agents & Plugins repo for DB-GPT, Can complete various tasks around databases.☆101Updated last year
- ☆39Updated 7 months ago
- The ChatGPT plugin to enhance OpenMLDB.☆51Updated 2 years ago
- ☆69Updated last year
- 全球首个StableVicuna中文优化版。☆64Updated last year
- 代码大模型 预训练&微调&DPO 数据处理 业界处理pipeline sota☆40Updated 10 months ago
- ☆224Updated last year
- llama inference for tencentpretrain☆98Updated 2 years ago
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆254Updated last week
- 实现一种多Lora权值集成切换+Zero-Finetune零微调增强的跨模型技术方案,LLM-Base+LLM-X+Alpaca,初期,LLM-Base为Chatglm6B底座模型,LLM-X是LLAMA增强模型。该方案简易高效,目标是使此类语言模型能够低能耗广泛部署,并最…☆115Updated last year
- zero零训练llm调参☆31Updated last year
- OrionStar-Yi-34B-Chat 是一款开源中英文Chat模型,由猎户星空基于Yi-34B开源模型、使用15W+高质量语料微调而成。☆258Updated last year
- share data, prompt data , pretraining data☆36Updated last year