01-ai / Yi
A series of large language models trained from scratch by developers @01-ai
☆7,827Updated 3 months ago
Alternatives and similar repositories for Yi:
Users that are interested in Yi are comparing it to the libraries listed below
- Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).☆6,826Updated last month
- The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.☆17,579Updated last month
- The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.☆5,670Updated 7 months ago
- a state-of-the-art-level open visual language model | 多模态预训练模型☆6,432Updated 9 months ago
- Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.☆6,316Updated this week
- A series of large language models developed by Baichuan Intelligent Technology☆4,128Updated 4 months ago
- Retrieval and Retrieval-augmented LLMs☆9,063Updated this week
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,362Updated 7 months ago
- SGLang is a fast serving framework for large language models and vision language models.☆12,427Updated this week
- [CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型☆7,327Updated this week
- ModelScope: bring the notion of Model-as-a-Service to life.☆7,586Updated this week
- LMDeploy is a toolkit for compressing, deploying, and serving LLMs.☆5,898Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆42,344Updated this week
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆21,922Updated 7 months ago
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆38,169Updated 3 weeks ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆10,333Updated 9 months ago
- Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.☆16,356Updated last week
- An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.☆4,766Updated last week
- An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.☆8,384Updated this week
- Universal LLM Deployment Engine with ML Compilation☆20,249Updated this week
- Large Language Model Text Generation Inference☆9,922Updated this week
- GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)☆7,676Updated last year
- Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…☆6,270Updated 5 months ago
- OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, …☆4,994Updated this week
- A 13B large language model developed by Baichuan Intelligent Technology☆2,979Updated last year
- 🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.☆17,837Updated this week
- RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable)…☆13,409Updated this week
- Instruction Tuning with GPT-4☆4,284Updated last year
- Fast and memory-efficient exact attention☆16,462Updated this week
- ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.☆9,465Updated last month