OpenBMB / BMPrinciples
A collection of phenomenons observed during the scaling of big foundation models, which may be developed into consensus, principles, or laws in the future
☆277Updated last year
Alternatives and similar repositories for BMPrinciples:
Users that are interested in BMPrinciples are comparing it to the libraries listed below
- The related works and background techniques about Openai o1☆215Updated last month
- ☆318Updated 7 months ago
- Paper List for In-context Learning 🌷☆177Updated last year
- Must-read Papers of Parameter-Efficient Tuning (Delta Tuning) Methods on Pre-trained Models.☆281Updated last year
- A repository sharing the literatures about long-context large language models, including the methodologies and the evaluation benchmarks☆256Updated 7 months ago
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆340Updated 5 months ago
- An Awesome Collection for LLM Survey☆328Updated 5 months ago
- [ACL 2024] A Survey of Chain of Thought Reasoning: Advances, Frontiers and Future☆421Updated last month
- [ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning☆412Updated 4 months ago
- Collection of training data management explorations for large language models☆311Updated 7 months ago
- Collaborative Training of Large Language Models in an Efficient Way☆413Updated 6 months ago
- ☆483Updated 2 months ago
- Efficient, Low-Resource, Distributed transformer implementation based on BMTrain☆246Updated last year
- Naive Bayes-based Context Extension☆320Updated 2 months ago
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning☆242Updated last year
- Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chat☆112Updated last year
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models☆245Updated 5 months ago
- A series of technical report on Slow Thinking with LLM☆438Updated this week
- ☆891Updated 7 months ago
- Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"☆350Updated last month
- Paper collection on building and evaluating language model agents via executable language grounding☆345Updated 10 months ago
- [EMNLP 2023] Lion: Adversarial Distillation of Proprietary Large Language Models☆204Updated last year
- papers related to LLM-agent that published on top conferences☆311Updated last year
- GPT-Fathom is an open-source and reproducible LLM evaluation suite, benchmarking 10+ leading open-source and closed-source LLMs as well a…☆349Updated 10 months ago
- LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment☆292Updated 10 months ago
- The repository for the survey paper <<Survey on Large Language Models Factuality: Knowledge, Retrieval and Domain-Specificity>>☆333Updated 10 months ago
- LongBench v2 and LongBench (ACL 2024)☆788Updated last month
- ☆132Updated 10 months ago
- 欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。☆300Updated 7 months ago
- ☆278Updated 10 months ago