OpenBMB / BMList
A List of Big Models
☆343Updated last year
Alternatives and similar repositories for BMList:
Users that are interested in BMList are comparing it to the libraries listed below
- Efficient Training (including pre-training and fine-tuning) for Big Models☆585Updated 2 weeks ago
- Efficient Inference for Big Models☆583Updated 2 years ago
- Efficient, Low-Resource, Distributed transformer implementation based on BMTrain☆253Updated last year
- Model Compression for Big Models☆161Updated last year
- ☆459Updated 10 months ago
- Naive Bayes-based Context Extension☆326Updated 4 months ago
- Collaborative Training of Large Language Models in an Efficient Way☆415Updated 8 months ago
- train llama on a single A100 80G node using 🤗 transformers and 🚀 Deepspeed Pipeline Parallelism☆219Updated last year
- ☆279Updated last year
- Live Training for Open-source Big Models☆506Updated last year
- An opensource ChatBot built with ExpertPrompting which achieves 96% of ChatGPT's capability.☆300Updated last year
- alpaca中文指令微调数据集☆393Updated 2 years ago
- ☆128Updated last year
- Implementation of Chinese ChatGPT☆287Updated last year
- Multi-language Enhanced LLaMA☆301Updated 2 years ago
- Fast Inference Solutions for BLOOM☆561Updated 6 months ago
- Code used for sourcing and cleaning the BigScience ROOTS corpus☆310Updated 2 years ago
- LongQLoRA: Extent Context Length of LLMs Efficiently☆164Updated last year
- [NIPS2023] RRHF & Wombat☆806Updated last year
- A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.☆807Updated 10 months ago
- Rectified Rotary Position Embeddings☆367Updated 11 months ago
- [ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark☆376Updated 9 months ago
- BiLLa: A Bilingual LLaMA with Enhanced Reasoning Ability☆420Updated last year
- ☆172Updated 2 years ago
- ☆458Updated last year
- Official repository for LongChat and LongEval☆519Updated 11 months ago
- MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.☆928Updated last month
- ☆318Updated 9 months ago
- ☆308Updated 2 years ago
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆44Updated last year