OpenBMB / BMPrinciplesLinks
A collection of phenomenons observed during the scaling of big foundation models, which may be developed into consensus, principles, or laws in the future
☆285Updated 2 years ago
Alternatives and similar repositories for BMPrinciples
Users that are interested in BMPrinciples are comparing it to the libraries listed below
Sorting:
- ☆320Updated last year
- The related works and background techniques about Openai o1☆221Updated 11 months ago
- Paper List for In-context Learning 🌷☆188Updated last year
- Efficient, Low-Resource, Distributed transformer implementation based on BMTrain☆264Updated 2 years ago
- An Awesome Collection for LLM Survey☆381Updated 7 months ago
- A paper & resource list of large language models, including course, paper, demo, figures☆200Updated 2 years ago
- A repository sharing the literatures about long-context large language models, including the methodologies and the evaluation benchmarks☆269Updated last year
- Collaborative Training of Large Language Models in an Efficient Way☆416Updated last year
- Collection of training data management explorations for large language models☆336Updated last year
- Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chat☆116Updated 2 years ago
- [ACL 2024] A Survey of Chain of Thought Reasoning: Advances, Frontiers and Future☆481Updated 11 months ago
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆410Updated 6 months ago
- A live reading list for LLM data synthesis (Updated to July, 2025).☆426Updated 4 months ago
- Must-read Papers of Parameter-Efficient Tuning (Delta Tuning) Methods on Pre-trained Models.☆286Updated 2 years ago
- papers related to LLM-agent that published on top conferences☆320Updated 8 months ago
- [ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning☆511Updated last year
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning☆284Updated 2 years ago
- Naive Bayes-based Context Extension☆326Updated last year
- Papers & Works for large languange models (OpenAI GPT-4, Meta Llama, etc.).☆317Updated last month
- ☆916Updated last year
- Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]☆579Updated last year
- [EMNLP 2023] Lion: Adversarial Distillation of Proprietary Large Language Models☆212Updated last year
- GPT-Fathom is an open-source and reproducible LLM evaluation suite, benchmarking 10+ leading open-source and closed-source LLMs as well a…☆346Updated last year
- Paper collections of retrieval-based (augmented) language model.☆232Updated last year
- The repository for the survey paper <<Survey on Large Language Models Factuality: Knowledge, Retrieval and Domain-Specificity>>☆340Updated last year
- Code for a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language Models☆66Updated 10 months ago
- A collection for math word problem (MWP) works, including datasets, algorithms and so on.☆47Updated last year
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆275Updated 10 months ago
- Paper List for a new paradigm of NLP: Interactive NLP (https://arxiv.org/abs/2305.13246)☆217Updated 2 years ago
- ☆550Updated 11 months ago