codefuse-ai / FasterTransformer4CodeFuse
High-performance LLM inference based on our optimized version of FastTransfomer
☆123Updated last year
Alternatives and similar repositories for FasterTransformer4CodeFuse:
Users that are interested in FasterTransformer4CodeFuse are comparing it to the libraries listed below
- Index of the CodeFuse Repositories☆136Updated 7 months ago
- High Accuracy and efficiency multi-task fine-tuning framework for Code LLMs. This work has been accepted by KDD 2024.☆683Updated 3 months ago
- Implement OpenAI APIs and plugin-enabled ChatGPT with open source LLM and other models.☆120Updated 10 months ago
- fastertransformer for codegeex model☆63Updated last year
- zero零训练llm调参☆31Updated last year
- CodeShell model in C/C++☆105Updated 9 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆135Updated 4 months ago
- XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.☆139Updated last year
- The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.☆442Updated 6 months ago
- Industrial-level evaluation benchmarks for Coding LLMs in the full life-cycle of AI native software developing.企业级代码大模型评测体系,持续开放中☆92Updated 3 weeks ago
- llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deploy…☆80Updated 11 months ago
- AGI模块库架构图☆75Updated last year
- share data, prompt data , pretraining data☆36Updated last year
- 360zhinao☆289Updated 3 months ago
- Multi-Agents & Plugins repo for DB-GPT, Can complete various tasks around databases.☆99Updated last year
- 利用LLM+敏感词库,来自动判别是否涉及敏感词。☆118Updated last year
- run chatglm3-6b in BM1684X☆38Updated last year
- 全球首个StableVicuna中文优化版。☆64Updated last year
- Mixture-of-Experts (MoE) Language Model☆186Updated 7 months ago
- self-host ChatGLM-6B API made with fastapi☆78Updated 2 years ago
- XVERSE-7B: A multilingual large language model developed by XVERSE Technology Inc.☆52Updated last year
- CodeGPT: A Code-Related Dialogue Dataset Generated by GPT and for GPT☆113Updated last year
- ⚡ boost inference speed of GPT models in transformers by onnxruntime☆53Updated last year
- ☆19Updated 3 months ago
- Humanable Chat Generative-model Fine-tuning | LLM微调☆206Updated last year
- OrionStar-Yi-34B-Chat 是一款开源中英文Chat模型,由猎户星空基于Yi-34B开源模型、使用15W+高质量语料微调而成。☆258Updated last year
- A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.☆37Updated 4 months ago
- ☆36Updated 6 months ago
- Efficient AI Inference & Serving☆471Updated last year
- ☆108Updated last year