mindspore-lab / mindrlhf
☆30Updated 2 months ago
Alternatives and similar repositories for mindrlhf:
Users that are interested in mindrlhf are comparing it to the libraries listed below
- 怎么训练一个LLM分词器☆140Updated last year
- ☆84Updated last year
- Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chat☆112Updated last year
- A flexible and efficient training framework for large-scale alignment tasks☆304Updated last week
- 文本去重☆68Updated 8 months ago
- 使用单个24G显卡,从0开始训练LLM☆50Updated 3 months ago
- A MoE impl for PyTorch, [ATC'23] SmartMoE☆61Updated last year
- ☆162Updated last year
- ☆104Updated 3 months ago
- ☆52Updated last year
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆38Updated 11 months ago
- NTK scaled version of ALiBi position encoding in Transformer.☆67Updated last year
- SuperCLUE-Agent: 基于中文原生任务的Agent智能体核心能力测评基准☆81Updated last year
- 大模型多维度中文对齐评测基准 (ACL 2024)☆359Updated 6 months ago
- ☆159Updated last year
- code for Scaling Laws of RoPE-based Extrapolation☆70Updated last year
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆148Updated this week
- 欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。☆298Updated 7 months ago
- ☆153Updated this week
- Train llm (bloom, llama, baichuan2-7b, chatglm3-6b) with deepspeed pipeline mode. Faster than zero/zero++/fsdp.☆92Updated last year
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆69Updated last year
- adds Sequence Parallelism into LLaMA-Factory☆154Updated this week
- train llama on a single A100 80G node using 🤗 transformers and 🚀 Deepspeed Pipeline Parallelism☆215Updated last year
- ☆130Updated 10 months ago
- ☆139Updated 7 months ago
- 大语言模型指令调优工具(支持 FlashAttention)☆169Updated last year
- 对ChatGLM直接使用RLHF提升或降低目标输出概率|Modify ChatGLM output with only RLHF☆192Updated last year
- ☆318Updated 7 months ago
- Collaborative Training of Large Language Models in an Efficient Way☆411Updated 5 months ago
- ☆43Updated last year