codefuse-ai / FasterTransformer4CodeFuse
High-performance LLM inference based on our optimized version of FastTransfomer
☆124Updated last year
Alternatives and similar repositories for FasterTransformer4CodeFuse:
Users that are interested in FasterTransformer4CodeFuse are comparing it to the libraries listed below
- Index of the CodeFuse Repositories☆136Updated 6 months ago
- High Accuracy and efficiency multi-task fine-tuning framework for Code LLMs. This work has been accepted by KDD 2024.☆670Updated 2 months ago
- fastertransformer for codegeex model☆63Updated last year
- Industrial-level evaluation benchmarks for Coding LLMs in the full life-cycle of AI native software developing.企业级代码大模型评测体系,持续开放中☆88Updated last year
- Implement OpenAI APIs and plugin-enabled ChatGPT with open source LLM and other models.☆120Updated 8 months ago
- GPT+神器,简单实用的一站式AGI架构,内置本地化,LLM模型,agent,矢量数据库,智能链chain☆48Updated last year
- Multi-Agents & Plugins repo for DB-GPT, Can complete various tasks around databases.☆97Updated last year
- AGI模块库架构图☆75Updated last year
- 360zhinao☆291Updated last month
- The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.☆440Updated 4 months ago
- 首个llama2 13b 中文版模型 (Base + 中文对话SFT,实现流畅多轮人机自然语言交互)☆89Updated last year
- AI Native IDE based on CodeFuse and OpenSumi☆214Updated this week
- UnitGen 是一个用于生成微调代码的数据框架 —— 直接从你的代码库中生成微调数据:代码补全、测试生成、文档生成等。UnitGen is a code fine-tuning data framework that generates data from your ex…☆54Updated 8 months ago
- 演示 vllm 对中文大语言模型的神奇效果☆31Updated last year
- ☆105Updated last year
- ☆107Updated 11 months ago
- TianMu: A modern AI tool with multi-platform support, markdown support, multimodal, continuous conversation, and customizable commands. 一…☆83Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆131Updated 3 months ago
- run chatglm3-6b in BM1684X☆38Updated last year
- ☆17Updated 2 months ago
- XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.☆139Updated 11 months ago
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆236Updated 3 weeks ago
- A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.☆37Updated 2 months ago
- zero零训练llm调参☆31Updated last year
- ☆59Updated 4 months ago
- 属于每个人的公众号”查特查特“上线啦!新问题、新方法、新发现,欢迎提PR!☆43Updated last year
- 部署你自己的OpenAI api🤩, 基于flask, transformers (使用 Baichuan2-13B-Chat-4bits 模型, 可以运行在单张Tesla T4显卡) ,实现了OpenAI中Chat, Models和Completions接口,包含流式响…☆89Updated last year
- Imitate OpenAI with Local Models☆87Updated 6 months ago
- IDPChat是开放的中文多模态模型☆56Updated last year
- llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deploy…☆79Updated 9 months ago