IEIT-Yuan / Yuan-2.0View external linksLinks
Yuan 2.0 Large Language Model
☆689Jul 11, 2024Updated last year
Alternatives and similar repositories for Yuan-2.0
Users that are interested in Yuan-2.0 are comparing it to the libraries listed below
Sorting:
- Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sour…☆1,485Mar 7, 2025Updated 11 months ago
- Mixture-of-Experts (MoE) Language Model☆194Sep 9, 2024Updated last year
- XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.☆140Apr 9, 2024Updated last year
- TigerBot: A multi-language multi-task LLM☆2,262Dec 28, 2024Updated last year
- Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).☆7,157Oct 30, 2025Updated 3 months ago
- A series of large language models trained from scratch by developers @01-ai☆7,844Nov 27, 2024Updated last year
- A series of large language models developed by Baichuan Intelligent Technology☆4,118Nov 8, 2024Updated last year
- A large-scale 7B pretraining language model developed by BaiChuan-Inc.☆5,686Jul 18, 2024Updated last year
- Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集☆3,055Apr 14, 2024Updated last year
- Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment☆1,036May 31, 2024Updated last year
- Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…☆6,635Oct 24, 2024Updated last year
- Orion-14B is a family of models includes a 14B foundation LLM, and a series of models: a chat model, a long context model, a quantized mo…☆810Jun 3, 2024Updated last year
- The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.☆444Oct 11, 2024Updated last year
- XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.☆644Apr 9, 2024Updated last year
- BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)☆8,281Oct 16, 2024Updated last year
- DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models☆1,894Jan 16, 2024Updated 2 years ago
- AgentTuning: Enabling Generalized Agent Abilities for LLMs☆1,477Oct 31, 2023Updated 2 years ago
- A 13B large language model developed by Baichuan Intelligent Technology☆2,951Sep 6, 2023Updated 2 years ago
- Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]☆1,812Jul 27, 2025Updated 6 months ago
- The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.☆20,361Jan 30, 2026Updated 2 weeks ago
- ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型☆13,757Jan 13, 2025Updated last year
- BlueLM(蓝心大模型): Open large language models developed by vivo AI Lab☆936Dec 30, 2024Updated last year
- fastllm是后端无依赖的高性能大模型推理库。同时支持张量并行推理稠密模型和混合模式推理MOE模型,任意10G以上显卡即可推理满血DeepSeek。双路9004/9005服务器+单显卡部署DeepSeek满血满精度原版模型,单并发20tps;INT4量化模型单并发30tp…☆4,149Updated this week
- The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"☆263May 9, 2024Updated last year
- CMMLU: Measuring massive multitask language understanding in Chinese☆802Dec 6, 2024Updated last year
- A generalized information-seeking agent system with Large Language Models (LLMs).☆1,193Jun 19, 2024Updated last year
- LMDeploy is a toolkit for compressing, deploying, and serving LLMs.☆7,606Updated this week
- FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model.☆3,881Nov 11, 2025Updated 3 months ago
- Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型☆4,177Aug 23, 2024Updated last year
- Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)☆2,694Aug 14, 2024Updated last year
- ⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)☆1,003Dec 6, 2024Updated last year
- A Next-Generation Training Engine Built for Ultra-Large MoE Models☆5,085Updated this week
- Open Multilingual Chatbot for Everyone☆1,274Jun 8, 2025Updated 8 months ago
- GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)☆7,677Jul 25, 2023Updated 2 years ago
- 中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)☆18,964Jul 15, 2025Updated 7 months ago
- SOTA Math Opensource LLM☆334Dec 12, 2023Updated 2 years ago
- ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型☆15,663Jun 27, 2024Updated last year
- OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, …☆6,663Feb 10, 2026Updated last week
- GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型☆7,051Jul 4, 2025Updated 7 months ago