codefuse-ai / MFTCoder
High Accuracy and efficiency multi-task fine-tuning framework for Code LLMs. This work has been accepted by KDD 2024.
☆618Updated 3 months ago
Related projects: ⓘ
- High-performance LLM inference based on our optimized version of FastTransfomer☆123Updated 9 months ago
- 🩹Editing large language models within 10 seconds⚡☆1,268Updated last year
- CMMLU: Measuring massive multitask language understanding in Chinese☆669Updated last week
- Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]☆1,601Updated 10 months ago
- FlagEval is an evaluation toolkit for AI large foundation models.☆290Updated 2 months ago
- Industrial-level evaluation benchmarks for Coding LLMs in the full life-cycle of AI native software developing.企业级代码 大模型评测体系,持续开放中☆72Updated 8 months ago
- AgentTuning: Enabling Generalized Agent Abilities for LLMs☆1,329Updated 10 months ago
- ☆881Updated 3 months ago
- [ACL 2024] LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding☆618Updated last week
- ☆286Updated 2 months ago
- agentUniverse is a LLM multi-agent framework that allows developers to easily build multi-agent applications.☆766Updated this week
- Index of the CodeFuse Repositories☆132Updated 2 weeks ago
- A generalized information-seeking agent system with Large Language Models (LLMs).☆1,074Updated 3 months ago
- Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, leaderboard, papers, docs and models, mainly for Evaluation on LLMs…☆396Updated last month
- Yuan 2.0 Large Language Model☆676Updated 2 months ago
- Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.☆562Updated 2 months ago
- ToolAlpaca: Generalized Tool Learning for Language Models with 3000 Simulated Cases☆281Updated 2 months ago
- CodeQwen1.5 is the code version of Qwen, the large language model series developed by Qwen team, Alibaba Cloud.☆431Updated 3 months ago
- An LLM-based Agent for the New Automation Paradigm - Agentic Process Automation☆754Updated 8 months ago
- XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.☆649Updated 5 months ago
- The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.☆429Updated 7 months ago
- C++ implementation of Qwen-LM☆531Updated 8 months ago
- ☆852Updated last month
- 本项目旨在收集开源的表格智能任务数据集(比如表格问答、表格-文本生成等),将原始数据整理为指令微调格式的数据并微调LLM,进而增强LLM对于表格数据的理解,最终构建出专门面向表格智能任务的大型语言模型。☆411Updated 4 months ago
- DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models☆970Updated 8 months ago
- Unify Efficient Fine-tuning of RAG Retrieval, including Embedding, ColBERT,Cross Encoder☆430Updated last month
- Tuning LLMs with no tears💦; Sample Design Engineering (SDE) for more efficient downstream-tuning.☆957Updated 4 months ago
- 大模型多维度中文对齐评测基准 (ACL 2024)☆293Updated last month
- OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, …☆3,741Updated last week
- ModelScope-Agent: An agent framework connecting models in ModelScope with the world☆2,607Updated this week