codefuse-ai / FasterTransformer4CodeFuseLinks
High-performance LLM inference based on our optimized version of FastTransfomer
☆122Updated 2 years ago
Alternatives and similar repositories for FasterTransformer4CodeFuse
Users that are interested in FasterTransformer4CodeFuse are comparing it to the libraries listed below
Sorting:
- Index of the CodeFuse Repositories☆137Updated last year
- High Accuracy and efficiency multi-task fine-tuning framework for Code LLMs. This work has been accepted by KDD 2024.☆704Updated 11 months ago
- fastertransformer for codegeex model☆65Updated 2 years ago
- 360zhinao☆291Updated 7 months ago
- run chatglm3-6b in BM1684X☆40Updated last year
- Implement OpenAI APIs and plugin-enabled ChatGPT with open source LLM and other models.☆121Updated last year
- bisheng model services backend☆33Updated last year
- C++ implementation of Qwen-LM☆611Updated last year
- CodeShell model in C/C++☆105Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆132Updated last year
- CodeLLaMA 中文版 - 代码生成助手,huggingface累积下载2w+次☆45Updated 2 years ago
- XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.☆141Updated last year
- Yuan 2.0 Large Language Model☆689Updated last year
- 部署你自己的OpenAI api🤩, 基于flask, transformers (使用 Baichuan2-13B-Chat-4bits 模型, 可以运行在单张Tesla T4显卡) ,实现了OpenAI中Chat, Models和Completions接口,包含流式响…☆96Updated 2 years ago
- Its an open source LLM based on MOE Structure.☆58Updated last year
- ☆113Updated last year
- ☆348Updated last year
- VS Code extension for CodeGeeX☆325Updated 2 years ago
- 支持中文场景的的小语言模型 llama2.c-zh☆150Updated last year
- The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.☆445Updated last year
- 首个llama2 13b 中文版模型 (Base + 中文对话SFT,实现流畅多轮人机自然语言交互)☆91Updated 2 years ago
- Mixture-of-Experts (MoE) Language Model☆192Updated last year
- llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deploy…☆89Updated last year
- ☆180Updated last week
- Efficient AI Inference & Serving☆478Updated last year
- OrionStar-Yi-34B-Chat 是一款开源中英文Chat模型,由猎户星空基于Yi-34B开源模型、使用15W+高质量语料微调而成。☆262Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆139Updated last year
- XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.☆645Updated last year
- Multi-Agents & Plugins repo for DB-GPT, Can complete various tasks around databases.☆104Updated last year
- Easy, fast, and cheap pretrain,finetune, serving for everyone☆316Updated 4 months ago