OpenBMB / BMListLinks
A List of Big Models
☆344Updated 2 years ago
Alternatives and similar repositories for BMList
Users that are interested in BMList are comparing it to the libraries listed below
Sorting:
- Efficient Inference for Big Models☆588Updated 2 years ago
- Efficient Training (including pre-training and fine-tuning) for Big Models☆608Updated 3 weeks ago
- ☆460Updated last year
- Collaborative Training of Large Language Models in an Efficient Way☆416Updated last year
- Efficient, Low-Resource, Distributed transformer implementation based on BMTrain☆262Updated last year
- Model Compression for Big Models☆164Updated 2 years ago
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆50Updated 2 years ago
- Crosslingual Generalization through Multitask Finetuning☆535Updated 11 months ago
- LLM Zoo collects information of various open- and close-sourced LLMs☆271Updated 2 years ago
- Official repository for LongChat and LongEval☆532Updated last year
- An opensource ChatBot built with ExpertPrompting which achieves 96% of ChatGPT's capability.☆300Updated 2 years ago
- Naive Bayes-based Context Extension☆326Updated 9 months ago
- ☆172Updated 2 years ago
- A unified tokenization tool for Images, Chinese and English.☆151Updated 2 years ago
- [NIPS2023] RRHF & Wombat☆812Updated last year
- The aim of this repository is to utilize LLaMA to reproduce and enhance the Stanford Alpaca☆98Updated 2 years ago
- Code used for sourcing and cleaning the BigScience ROOTS corpus☆314Updated 2 years ago
- train llama on a single A100 80G node using 🤗 transformers and 🚀 Deepspeed Pipeline Parallelism☆224Updated last year
- ☆763Updated last year
- Rectified Rotary Position Embeddings☆381Updated last year
- A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.☆825Updated last year
- Live Training for Open-source Big Models☆507Updated 2 years ago
- Multi-language Enhanced LLaMA☆302Updated 2 years ago
- ☆127Updated 2 years ago
- ☆308Updated 2 years ago
- Chinese large language model base generated through incremental pre-training on Chinese datasets☆238Updated 2 years ago
- [ICLR 2023] Codebase for Copy-Generator model, including an implementation of kNN-LM☆187Updated 7 months ago
- MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.☆938Updated 5 months ago
- ☆281Updated last year
- ☆273Updated 2 years ago