SkyworkAI / SkyworkLinks

Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation methods, etc.

☆1,409

Alternatives and similar repositories for Skywork

Users that are interested in Skywork are comparing it to the libraries listed below

Sorting:

xverse-ai / XVERSE-13B
XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.
☆645Updated last year
hkust-nlp / ceval
Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]
☆1,760Updated last week
TigerResearch / TigerBot
TigerBot: A multi-language multi-task LLM
☆2,256Updated 7 months ago
haonan-li / CMMLU
CMMLU: Measuring massive multitask language understanding in Chinese
☆774Updated 7 months ago
ymcui / Chinese-Mixtral
中文Mixtral混合专家大模型（Chinese Mixtral MoE LLMs）
☆604Updated last year
michael-wzhu / Chinese-LlaMA2
Repo for adapting Meta LlaMA2 in Chinese! META最新发布的LlaMA2的汉化版！（完全开源可商用）
☆742Updated last year
IEIT-Yuan / Yuan-2.0
Yuan 2.0 Large Language Model
☆689Updated last year
HIT-SCIR / Chinese-Mixtral-8x7B
中文Mixtral-8x7B（Chinese-Mixtral-8x7B）
☆650Updated 11 months ago
baichuan-inc / Baichuan-13B
A 13B large language model developed by Baichuan Intelligent Technology
☆2,969Updated last year
FlagAI-Open / Aquila2
The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.
☆444Updated 9 months ago
charent / Phi2-mini-Chinese
Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型，支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.
☆561Updated last year
hikariming / chat-dataset-baseline
人工精调的中文对话数据集和一段chatglm的微调代码
☆1,184Updated 3 months ago
charent / ChatLM-mini-Chinese
中文对话0.2B小模型（ChatLM-Chinese-0.2B），开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调，给出三元组信息抽取微调示例。
☆1,570Updated last year
Duxiaoman-DI / XuanYuan
轩辕：度小满中文金融对话大模型
☆1,246Updated 6 months ago
OpenLMLab / GAOKAO-Bench
GAOKAO-Bench is an evaluation framework that utilizes GAOKAO questions as a dataset to evaluate large language models.
☆673Updated 6 months ago
Tencent / TencentPretrain
Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo
☆1,078Updated last year
beyondguo / LLM-Tuning
Tuning LLMs with no tears💦; Sample Design Engineering (SDE) for more efficient downstream-tuning.
☆1,009Updated last year
THUDM / WebGLM
WebGLM: An Efficient Web-enhanced Question Answering System (KDD 2023)
☆1,602Updated 4 months ago
CVI-SZU / Linly
Chinese-LLaMA 1&2、Chinese-Falcon 基础模型；ChatFlow中文对话模型；中文OpenLLaMA模型；NLP预训练/指令微调数据集
☆3,055Updated last year
AndrewZhe / lawyer-llama
中文法律LLaMA (LLaMA for Chinese legel domain)
☆956Updated 11 months ago
baichuan-inc / Baichuan2
A series of large language models developed by Baichuan Intelligent Technology
☆4,120Updated 8 months ago
git-cloner / aliendao
huggingface mirror download
☆584Updated 4 months ago
PhoebusSi / Alpaca-CoT
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tunin…
☆2,761Updated last year
LC1332 / Chinese-alpaca-lora
骆驼:A Chinese finetuned instruction LLaMA. Developed by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子昂 @ 商汤科技
☆721Updated 2 years ago
HIT-SCIR / huozi
活字通用大模型
☆393Updated 10 months ago
chaoswork / sft_datasets
开源SFT数据集整理,随时补充
☆530Updated 2 years ago
ssbuild / chatglm_finetuning
chatglm 6b finetuning and alpaca finetuning
☆1,543Updated 4 months ago
vivo-ai-lab / BlueLM
BlueLM(蓝心大模型): Open large language models developed by vivo AI Lab
☆919Updated 7 months ago
airaria / Visual-Chinese-LLaMA-Alpaca
多模态中文LLaMA&Alpaca大语言模型（VisualCLA）
☆450Updated 2 years ago
OrionStarAI / Orion
Orion-14B is a family of models includes a 14B foundation LLM, and a series of models: a chat model, a long context model, a quantized mo…
☆793Updated last year