swtheing / LLM-Performance-Improvement-PaperLinks
☆17Updated 2 years ago
Alternatives and similar repositories for LLM-Performance-Improvement-Paper
Users that are interested in LLM-Performance-Improvement-Paper are comparing it to the libraries listed below
Sorting:
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning☆263Updated last year
- Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chat☆114Updated 2 years ago
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆379Updated 3 weeks ago
- ☆319Updated last year
- [EMNLP 2023] Lion: Adversarial Distillation of Proprietary Large Language Models☆209Updated last year
- ☆142Updated last year
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models☆266Updated 10 months ago
- Naive Bayes-based Context Extension☆326Updated 7 months ago
- ☆128Updated 2 years ago
- ☆172Updated last year
- 大模型多维度中文对齐评测基准 (ACL 2024)☆398Updated 11 months ago
- ☆281Updated last year
- ☆144Updated last year
- A collection of phenomenons observed during the scaling of big foundation models, which may be developed into consensus, principles, or l…☆281Updated last year
- ☆36Updated 7 months ago
- a-m-team's exploration in large language modeling☆173Updated last month
- ☆83Updated last year
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆140Updated 2 months ago
- The related works and background techniques about Openai o1☆223Updated 6 months ago
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆228Updated 4 months ago
- ☆294Updated 11 months ago
- [ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step☆280Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆18Updated last year
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation☆85Updated 8 months ago
- 怎么训练一个LLM分词器☆151Updated 2 years ago
- ☆63Updated 2 years ago
- 中文 Instruction tuning datasets☆132Updated last year
- Paper List for a new paradigm of NLP: Interactive NLP (https://arxiv.org/abs/2305.13246)☆215Updated 2 years ago
- A One-Stop Reward Model Platform☆45Updated this week
- Collaborative Training of Large Language Models in an Efficient Way☆416Updated 10 months ago