mindspore-lab / mindrlhfLinks

☆36

Alternatives and similar repositories for mindrlhf

Users that are interested in mindrlhf are comparing it to the libraries listed below

Sorting:

l294265421 / alpaca-rlhf
Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chat
☆115Updated 2 years ago
genggui001 / Megatron-DeepSpeed-Llama
☆84Updated 2 years ago
hengjiUSTC / learn-llm
☆115Updated last year
akaihaoshuai / baby-llama2-chinese_cybertron
使用单个24G显卡，从0开始训练LLM
☆55Updated 4 months ago
yanqiangmiffy / how-to-train-tokenizer
怎么训练一个LLM分词器
☆154Updated 2 years ago
alibaba / ChatLearn
A flexible and efficient training framework for large-scale alignment tasks
☆439Updated last month
OpenMOSS / CoLLiE
Collaborative Training of Large Language Models in an Efficient Way
☆415Updated last year
cauyxy / bilivideos
☆51Updated 2 years ago
mindspore-lab / mindformers
☆177Updated this week
THUDM / AlignBench
大模型多维度中文对齐评测基准 (ACL 2024)
☆421Updated 3 weeks ago
OpenBMB / ModelCenter
Efficient, Low-Resource, Distributed transformer implementation based on BMTrain
☆264Updated last year
Glanvery / LLM-Travel
欢迎来到 "LLM-travel" 仓库！探索大语言模型（LLM）的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。
☆352Updated last year
OpenLLMAI / OpenLLMWiki
OpenLLMWiki: Docs of OpenLLMAI. Survey, reproduction and domain/task adaptation of open source chatgpt alternatives/implementations. PiXi…
☆262Updated 11 months ago
CASIA-LM / ChineseWebText
☆180Updated 2 years ago
firechecking / CleanTransformer
an implementation of transformer, bert, gpt, and diffusion models for learning purposes
☆159Updated last year
sunzeyeah / RLHF
Implementation of Chinese ChatGPT
☆289Updated 2 years ago
HarderThenHarder / RLLoggingBoard
A visuailzation tool to make deep understaning and easier debugging for RLHF training.
☆265Updated 9 months ago
sunkx109 / llama
Inference code for LLaMA models
☆127Updated 2 years ago
CoinCheung / gdGPT
Train llm (bloom, llama, baichuan2-7b, chatglm3-6b) with deepspeed pipeline mode. Faster than zero/zero++/fsdp.
☆98Updated last year
stanleylsx / llms_tool
一个基于HuggingFace开发的大语言模型训练、测试工具。支持各模型的webui、终端预测，低参数量及全参数模型训练(预训练、SFT、RM、PPO、DPO)和融合、量化。
☆220Updated last year
XU-YIJIE / grpo-flat
Train your grpo with zero dataset and low resources, 8bit/4bit/lora/qlora supported, multi-gpu supported ...
☆79Updated 6 months ago
alibaba / Megatron-LLaMA
Best practice for training LLaMA models in Megatron-LM
☆660Updated last year
bojone / NBCE
Naive Bayes-based Context Extension
☆325Updated 11 months ago
xv44586 / Chinese-instruction-datasets
中文 Instruction tuning datasets
☆141Updated last year
OpenBMB / BMTrain
Efficient Training (including pre-training and fine-tuning) for Big Models
☆612Updated 3 weeks ago
OpenBMB / UltraEval
[ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.
☆252Updated last year
yangjianxin1 / LLMPruner
☆313Updated 2 years ago
Chinese-Tiny-LLM / Chinese-Tiny-LLM
☆235Updated last year
HuangLK / transpeeder
train llama on a single A100 80G node using 🤗 transformers and 🚀 Deepspeed Pipeline Parallelism
☆225Updated 2 years ago
xubuvd / LLMs
专注于中文领域大语言模型，落地到某个行业某个领域，成为一个行业大模型、公司级别或行业级别领域大模型。
☆126Updated 8 months ago