OpenBMB / BMTrain
Efficient Training (including pre-training and fine-tuning) for Big Models
☆548Updated last month
Related projects: ⓘ
- Model Compression for Big Models☆151Updated last year
- Efficient Inference for Big Models☆573Updated last year
- Best practice for training LLaMA models in Megatron-LM☆606Updated 8 months ago
- Efficient, Low-Resource, Distributed transformer implementation based on BMTrain☆233Updated 9 months ago
- Collaborative Training of Large Language Models in an Efficient Way☆405Updated 3 weeks ago
- ☆447Updated 3 months ago
- [ACL 2024] LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding☆615Updated last week
- The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.☆649Updated last week
- Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.☆562Updated 2 months ago
- [NIPS2023] RRHF & Wombat☆789Updated 11 months ago
- Implementation of Chinese ChatGPT☆282Updated 9 months ago
- ☆310Updated 2 months ago
- ☆265Updated 4 months ago
- Live Training for Open-source Big Models☆512Updated last year
- train llama on a single A100 80G node using 🤗 transformers and 🚀 Deepspeed Pipeline Parallelism☆207Updated 9 months ago
- FlagEval is an evaluation toolkit for AI large foundation models.☆290Updated 2 months ago
- ☆852Updated last month
- 开源SFT数据集整理,随时补充☆413Updated last year
- 更纯粹、更高压缩率的Tokenizer☆438Updated 5 months ago
- A collection of phenomenons observed during the scaling of big foundation models, which may be developed into consensus, principles, or l…☆274Updated last year
- Naive Bayes-based Context Extension☆310Updated last year
- A plug-and-play library for parameter-efficient-tuning (Delta Tuning)☆978Updated last year
- ☆686Updated 3 months ago
- Firefly中文LLaMA-2大模型,支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型☆396Updated 10 months ago
- pCLUE: 1000000+多任务提示学习数据集☆461Updated last year
- 大模型多维度中文对齐评测基准 (ACL 2024)☆292Updated last month
- ☆131Updated last week
- ToolAlpaca: Generalized Tool Learning for Language Models with 3000 Simulated Cases☆281Updated 2 months ago
- Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo☆1,013Updated last month
- A List of Big Models☆340Updated last year