OpenBMB / ModelCenter
Efficient, Low-Resource, Distributed transformer implementation based on BMTrain
☆256Updated last year
Alternatives and similar repositories for ModelCenter
Users that are interested in ModelCenter are comparing it to the libraries listed below
Sorting:
- Model Compression for Big Models☆162Updated last year
- Efficient Training (including pre-training and fine-tuning) for Big Models☆589Updated this week
- Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chat☆115Updated last year
- A collection of phenomenons observed during the scaling of big foundation models, which may be developed into consensus, principles, or l…☆280Updated last year
- ☆280Updated last year
- Implementation of Chinese ChatGPT☆287Updated last year
- Collaborative Training of Large Language Models in an Efficient Way☆415Updated 8 months ago
- ☆459Updated 11 months ago
- 大模型多维度中文对齐评测基准 (ACL 2024)☆386Updated 9 months ago
- 中文图书语料MD5链接☆218Updated last year
- 欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。☆321Updated 9 months ago
- ☆168Updated last year
- 中文 Instruction tuning datasets☆131Updated last year
- 开源SFT数据集整理,随时补充☆513Updated last year
- ☆128Updated last year
- ☆308Updated 2 years ago
- 对ChatGLM直接使用RLHF提升或降低目标输出概率|Modify ChatGLM output with only RLHF☆192Updated last year
- ☆319Updated 10 months ago
- Naive Bayes-based Context Extension☆326Updated 5 months ago
- OpenLLMWiki: Docs of OpenLLMAI. Survey, reproduction and domain/task adaptation of open source chatgpt alternatives/implementations. PiXi…☆259Updated 5 months ago
- 怎么训练一个LLM分词器☆144Updated last year
- MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDING☆87Updated last year
- train llama on a single A100 80G node using 🤗 transformers and 🚀 Deepspeed Pipeline Parallelism☆219Updated last year
- ☆160Updated 2 years ago
- ☆84Updated last year
- 语言模型中文认知能力分析☆237Updated last year
- Firefly中文LLaMA-2大模型,支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型☆410Updated last year
- Live Training for Open-source Big Models☆506Updated last year
- Baichuan2代码的逐行解析版本,适合小白☆214Updated last year
- ☆172Updated 2 years ago