01-ai / Yi
A series of large language models trained from scratch by developers @01-ai
☆7,830Updated 5 months ago
Alternatives and similar repositories for Yi:
Users that are interested in Yi are comparing it to the libraries listed below
- The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.☆18,214Updated last week
- Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).☆6,890Updated 3 months ago
- LMDeploy is a toolkit for compressing, deploying, and serving LLMs.☆6,303Updated this week
- Retrieval and Retrieval-augmented LLMs☆9,558Updated 3 weeks ago
- GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)☆7,681Updated last year
- High-speed Large Language Model Serving for Local Deployment☆8,191Updated 2 months ago
- a state-of-the-art-level open visual language model | 多模态预训练模型☆6,514Updated 11 months ago
- The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.☆5,851Updated 9 months ago
- Large Language Model Text Generation Inference☆10,081Updated last week
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆48,565Updated this week
- A series of large language models developed by Baichuan Intelligent Technology☆4,126Updated 6 months ago
- SGLang is a fast serving framework for large language models and vision language models.☆13,976Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆46,848Updated this week
- 中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)☆7,158Updated 7 months ago
- Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you ne…☆7,770Updated this week
- 🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.☆18,320Updated this week
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆38,206Updated last week
- QLoRA: Efficient Finetuning of Quantized LLMs☆10,417Updated 11 months ago
- Universal LLM Deployment Engine with ML Compilation☆20,579Updated last week
- An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.☆4,831Updated 3 weeks ago
- Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…☆6,381Updated 6 months ago
- Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用☆14,570Updated last month
- Example models using DeepSpeed☆6,479Updated 3 weeks ago
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆22,398Updated 8 months ago
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆7,906Updated last week
- DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model☆4,887Updated 7 months ago
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆8,463Updated last year
- 中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)☆18,824Updated last year
- Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, GLM4, Mistral, Yi1.5, DeepSeek-R1, ...) an…☆7,450Updated this week
- A large-scale 7B pretraining language model developed by BaiChuan-Inc.☆5,691Updated 9 months ago