codefuse-ai / FasterTransformer4CodeFuseLinks
High-performance LLM inference based on our optimized version of FastTransfomer
☆123Updated last year
Alternatives and similar repositories for FasterTransformer4CodeFuse
Users that are interested in FasterTransformer4CodeFuse are comparing it to the libraries listed below
Sorting:
- Index of the CodeFuse Repositories☆138Updated 10 months ago
- High Accuracy and efficiency multi-task fine-tuning framework for Code LLMs. This work has been accepted by KDD 2024.☆694Updated 6 months ago
- fastertransformer for codegeex model☆63Updated 2 years ago
- 360zhinao☆290Updated 2 months ago
- VS Code extension for CodeGeeX☆315Updated 2 years ago
- OrionStar-Yi-34B-Chat 是一款开源中英文Chat模型,由猎户星空基于Yi-34B开源模型、使用15W+高质量 语料微调而成。☆259Updated last year
- Efficient AI Inference & Serving☆472Updated last year
- 全球首个StableVicuna中文优化版。☆64Updated 2 years ago
- CodeShell model in C/C++☆106Updated 11 months ago
- Implement OpenAI APIs and plugin-enabled ChatGPT with open source LLM and other models.☆120Updated last year
- Easy, fast, and cheap pretrain,finetune, serving for everyone☆310Updated last month
- 首个llama2 13b 中文版模型 (Base + 中文对话SFT,实现流畅多轮人机自然语言交互)☆90Updated last year
- The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.☆444Updated 9 months ago
- ☆169Updated this week
- Yuan 2.0 Large Language Model☆688Updated last year
- 部署你自己的OpenAI api🤩, 基于flask, transformers (使用 Baichuan2-13B-Chat-4bits 模型, 可以运行在单张Tesla T4显卡) ,实现了OpenAI中Chat, Models和Completions接口,包含流式响…☆93Updated last year
- UnitGen 是一个用于生成微调代码的数据框架 —— 直接从你的代码库中生成微调数据:代码补全、测试生成、文档生成等。UnitGen is a code fine-tuning data framework that generates data from your ex…☆57Updated last year
- ☆225Updated last year
- Play LLaMA2 (official / 中文版 / INT4 / llama2.cpp) Together! ONLY 3 STEPS! ( non GPU / 5GB vRAM / 8~14GB vRAM)☆542Updated last year
- IDPChat是开放的中文多模态模型☆56Updated 2 years ago
- AGI模块库架构图☆76Updated last year
- Mixture-of-Experts (MoE) Language Model☆189Updated 10 months ago
- run chatglm3-6b in BM1684X☆39Updated last year
- C++ implementation of Qwen-LM☆600Updated 7 months ago
- XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.☆139Updated last year
- ☆349Updated 11 months ago
- ☆110Updated last year
- XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.☆646Updated last year
- llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deploy…☆85Updated last year
- bisheng-unstructured library☆54Updated 2 months ago