OpenBMB / BMPrinciplesLinks
A collection of phenomenons observed during the scaling of big foundation models, which may be developed into consensus, principles, or laws in the future
β280Updated last year
Alternatives and similar repositories for BMPrinciples
Users that are interested in BMPrinciples are comparing it to the libraries listed below
Sorting:
- β319Updated 10 months ago
- Paper List for In-context Learning π·β183Updated last year
- The related works and background techniques about Openai o1β221Updated 4 months ago
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other moβ¦β370Updated 9 months ago
- Efficient, Low-Resource, Distributed transformer implementation based on BMTrainβ255Updated last year
- A repository sharing the literatures about long-context large language models, including the methodologies and the evaluation benchmarksβ263Updated 10 months ago
- Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chatβ115Updated 2 years ago
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuningβ261Updated last year
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Modelsβ264Updated 8 months ago
- β141Updated last year
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.β203Updated 3 months ago
- Collection of training data management explorations for large language modelsβ325Updated 10 months ago
- [EMNLP 2023] Lion: Adversarial Distillation of Proprietary Large Language Modelsβ206Updated last year
- Collaborative Training of Large Language Models in an Efficient Wayβ415Updated 9 months ago
- Must-read Papers of Parameter-Efficient Tuning (Delta Tuning) Methods on Pre-trained Models.β284Updated last year
- [ACL 2024] A Survey of Chain of Thought Reasoning: Advances, Frontiers and Futureβ450Updated 4 months ago
- 倧樑εε€η»΄εΊ¦δΈζε―Ήι½θ―ζ΅εΊε (ACL 2024)β389Updated 9 months ago
- Real-time updated, fine-grained reading list on LLM-synthetic-data.π₯β259Updated 4 months ago
- an implementation of transformer, bert, gpt, and diffusion models for learning purposesβ153Updated 7 months ago
- Naive Bayes-based Context Extensionβ326Updated 5 months ago
- β540Updated 5 months ago
- [ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuningβ453Updated 7 months ago
- An Awesome Collection for LLM Surveyβ360Updated last week
- [ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.β243Updated 7 months ago
- A paper & resource list of large language models, including course, paper, demo, figuresβ199Updated last year
- GPT-Fathom is an open-source and reproducible LLM evaluation suite, benchmarking 10+ leading open-source and closed-source LLMs as well aβ¦β348Updated last year
- Code for a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language Modelsβ62Updated 3 months ago
- papers related to LLM-agent that published on top conferencesβ315Updated last month
- Paper collections of retrieval-based (augmented) language model.β232Updated last year
- β169Updated last year