swtheing / LLM-Performance-Improvement-PaperLinks
☆17Updated 2 years ago
Alternatives and similar repositories for LLM-Performance-Improvement-Paper
Users that are interested in LLM-Performance-Improvement-Paper are comparing it to the libraries listed below
Sorting:
- Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chat☆117Updated 2 years ago
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning☆284Updated 2 years ago
- ☆147Updated last year
- ☆184Updated 2 years ago
- [EMNLP 2023] Lion: Adversarial Distillation of Proprietary Large Language Models☆212Updated last year
- ☆321Updated last year
- a-m-team's exploration in large language modeling☆195Updated 8 months ago
- ☆84Updated 2 years ago
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆137Updated 9 months ago
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆416Updated 7 months ago
- Naive Bayes-based Context Extension☆327Updated last year
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation☆90Updated last year
- The related works and background techniques about Openai o1☆220Updated last year
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models☆269Updated last year
- [ICLR 2025] 🧬 RegMix: Data Mixture as Regression for Language Model Pre-training (Spotlight)☆185Updated 11 months ago
- A repository sharing the literatures about long-context large language models, including the methodologies and the evaluation benchmarks☆272Updated last year
- Llama-3-SynE: A Significantly Enhanced Version of Llama-3 with Advanced Scientific Reasoning and Chinese Language Capabilities | 继续预训练提升 …☆37Updated 8 months ago
- Efficient, Low-Resource, Distributed transformer implementation based on BMTrain☆266Updated 2 years ago
- ☆322Updated last year
- ☆147Updated last year
- 怎么训练一个LLM分词器☆153Updated 2 years ago
- [ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step☆304Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆18Updated 2 years ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆187Updated 7 months ago
- A collection for math word problem (MWP) works, including datasets, algorithms and so on.☆47Updated last year
- ☆129Updated 2 years ago
- 大模型多维度中文对齐评测基准 (ACL 2024)☆421Updated 3 months ago
- ☆164Updated 2 years ago
- ☆98Updated last year
- ☆70Updated 2 years ago